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Preface 


It is natural to ask what a functional equation is. But there is no easy satisfactory 
answer to this question. While such concepts as element, relation, mapping, opera- 
tion, etc., are well defined in set theory, the principal concept set is an undefined term. 
It is not considered as an impediment for the growth, development, and usefulness 
of set theory. We are going to follow this idea and consider functional equations as 
an undefined concept. This is not going to be a severe roadblock to understanding, 
appreciating, and contributing to the growth and development of this fascinating area. 
As in set theory, we hope the reader will get a general insight of what this theory is 
about. 

Functional equations occur practically everywhere. Their influence and appli- 
cations are felt in every field, and all fields benefit from their contact, use, and 
technique. The growth and development used to be influenced by their spectacu- 
lar application in several areas—not only in mathematics but also in other disci- 
plines. Applications can be found in a wide variety of fields—analysis, applied 
science, behavioural and social science, biology, combinatorics, computers, eco- 
nomics, engineering, geometry, inequalities, information theory, inner product space, 
physics, polynomials, psychology, reproducing scoring system, statistics, taxation, 
etc. Functional equations are being used with vigor in ever-increasing numbers to 
investigate problems in the above-mentioned areas and other fields. 

Even though many eminent mathematicians—including Abel (1823), Banach 
(1920), Cauchy (1821), Darboux (1895), Euler (1768), Ostrowski (1929), Pexider 
(1903), and Poisson (1804)—since the time of d’ Alembert (1769) have contributed 
to this field, no systematic presentation appeared until 1966 [12], 1989 [44], 1992 
[144], and 1985 [558]. To remedy this situation, some books, monographs, and 
survey articles that treat and discuss many aspects and methods of functional equa- 
tions are available. Notable are: 


e J. Aczél, Lectures on Functional Equations and Their Applications [12] 

e J. Aczél and Z. Daréczy, On Measures of Information and Their Characteriza- 
tions [43] 

e J. Aczél and J. Dhombres, Functional Equations in Several Variables [44] 
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e E. Castillo and M.R. Ruiz-Cobo, Functional Equations and Modelling in Science 
and Engineering [144] 

S. Czerwik, Functional Equations and Inequalities in Several Variables [187] 

J. Dhombres, Some Aspects of Functional Equations [218] 

B.R. Ebanks, P.K. Sahoo, and W. Sander, Characterization of Information Mea- 
sures [245] 

W. Eichhorn, Functional Equations in Economics [250] 

M. Ghermanescu, Ecuatti Functionale [315] 

D.H. Hyers, G. Isac, and Th.M. Rassias, Stability of Functional Equations in 
Several Variables [385] 

e S.M. Jung, Hyers-Ulam-Rassias Stability of Functional Equations in Mathema- 
tical Analysis[404] 

Pl. Kannappan, Theory of functional equations [426] 

M. Kuczma, Functional Equations in a Single Variable [554] 

M. Kuczma, An Introduction to the Theory of Functional Equations and Inequali- 
ties: Cauchy’s Equation and Jensen’s Inequality [558] 

M. Kuczma, A survey of the theory of functional equations [552] 

M. Kuczma, B. Choczewski, and R. Ger, Iterative Functional Equations [560] 
Th.M. Rassias, Functional Equations and Inequalities [681] 

Th.M. Rassias, Inner product spaces and applications [683] 

P.K. Sahoo and T. Riedel, Mean Value Theorems and Functional Equations [722] 
J. Smital, On Functions and Functional Equations [757] 

L. Székelyhidi, Convolution Type Functional Equations on Topological Abelian 
Groups [796] 


To highlight the field’s usefulness, applications, and impact on multiple fields 
and growth, we cite extracts from two books [44, 144]. From [44]: 

From their very beginnings, functional equations arose from applications, were 
developed mostly for the sake of applications and, indeed, were applied quite inten- 
sively as soon as they were developed. Such a course of development is not typical 
in all theoretical aspects of mathematics. 

This brings us to the natural question of motivation: To what kind of problems 
can functional equations be applied (and not only within mathematics). Mathemati- 
cians often construct new notions, partly based on older ones, which may come from 
mathematics or from the natural, behavioural and social sciences, for instance from 
physics or economics. Thus the notion of affine vectors evolved in order to represent 
forces, the (homogeneous) linear function to describe proportionality, the logarith- 
mic function to transform geometric sequences into algebraic ones, and the trigono- 
metric functions to determine unknown parts of triangles from known ones. The next 
task of the mathematician is to describe the properties of these new objects, that is, 
to establish their relation to other objects in order to include them in a systematic and 
orderly way into an existing or new theory. ... The vectors, for instance, form the 
foundation of linear algebra, the trigonometric functions that of trigonometry. .. . 

If we are lucky, the properties deduced from the definitions of mathematical 
objects can be quite numerous, even too rich. According to the principle of economy 
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of reasoning and to a legitimate desire for elegance, it is desirable to check whether 
the newly introduced objects are the only ones which have some of the most impor- 
tant properties that we have just established. It is here where functional equations 
may enter. One takes these properties as points of departure and tries to determine 
all objects satisfying them and, in particular, to find conditions under which there 
is unicity. ... This is the fundamental procedure which motivates functional equa- 
tions. We see that is axiomatic in nature. But we see also that the axioms are not 
arbitrarily chosen, we do not generalize just for the sake of generalization. In fact, 
one prefers to take as points of departure those which are considered the most useful 
for applications. 

After such a uniqueness result is established, for instance that concerning the 
equation of the cosine (d’ Alembert’s equation), one also tries to deduce (using the 
equation, not the solutions) the classical properties of the cosine directly. One thus 
arrives at an elegant way of developing trigonometry which, with some changes, can 
also be used to introduce and develop elliptic and, especially, hyperbolic trigono- 
metry. 

As in all parts of mathematics, different authors’ viewpoints and levels of discus- 
sion of functional equations are different. Also, sometimes the methods of reduction 
of one functional equation to another or the proof of their equivalence or discus- 
sion and/or reduction of regularity conditions, extension of domains, etc. give rise 
to further intrinsic developments. This is a further sign of the increasing maturity 
of this field of mathematics. On the other hand, the characterization of functions 
by their equations, as described above, involves many branches of pure and applied 
mathematics in the development of the theory of functional equations. 

From [144]: 

... E. Castillo and A. Fernandez-Cantelli ... were trying to model the influence 
of length and stress range on the fatigue life of longitudinal elements and, when 
analyzing the inconsistencies of some tentative models, ... found a compatibility 
equation which was written in terms of a functional equation and [encountered for 
the first time the field of functional equations]. ... Since then, we have completely 
changed our minds and incorporated the functional equations’ philosophy and tech- 
niques in [our] daily procedures. Even though many years were required to find our 
first functional equation, many functional equations have appeared since then in our 
work, and, in fact, we would not think of building models or stating problems today 
without using functional equations. 

Our experience is that model building in science or engineering is usually per- 
formed based on an arbitrary selection of simple and easily tractable equations that 
seem to reproduce reality to a given quality level. However, on many occasions 
these models exhibit technical failures or inconsistencies, such as those we discov- 
ered in our fatigue models when we got the compatibility equation, which make them 
unacceptable. We have found that functional equations are one of the main tools that 
avoid arbitrariness and allow a rigorous and consistent selection of models. In fact, 
conditions required by many models to be adequate replicas of reality can be written 
as functional equations. 
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Though the theory of functional equations is very old, not only technicians but 
many mathematicians are still unaware of the power of this important field of mathe- 
matics. However, most of the recent advances in the theory of functional equations 
have been published in mathematical journals which are not written in a language 
that many engineers and scientists can easily understand. This fact, which is com- 
mon to many other areas of Mathematics, has been the reason why many engineers 
and applied scientists are still unaware of the long list of these advances and, con- 
sequently, they have not incorporated common functional equation techniques into 
their daily procedures. 

Our experience with functional equations was so positive and relevant for 
applications that we became engrossed in this, not well known, field of Mathema- 
tics. Impressed by its importance and desiring to share with others this discovery, we 
decided to write the present book. 

One of the aims of this book is to provide engineers and applied scientists with 
some selected results of functional equations which can be useful in applications. 
We are aware that this is not an easy task, and that any effort to bring together mathe- 
maticians and engineers, as experience shows, has many associated difficulties. ... 
However, we wish to go even further, trying to change the mentality of the readers 
and offer them a new way of thinking in mathematical modelling. Traditionally, 
engineers and scientists are used to stating practical problems in terms of derivatives 
or integrals, which lead to differential or integral equations, respectively. With this 
book we want to offer them the possibility of using functional equations too, as one 
more alternative which is at least as powerful as either of the other two. 

Applications of functional equations were found much earlier than any system- 
atic presentation could develop and brought about investigations to treat many more 
new functional equations. Hence the development of the theory at first (and to some 
extent now) was closely related to its applications to various branches of mathemat- 
ics and other areas. Now studies for their own sake and not influenced by appli- 
cations are also being pursued vigorously. The large number of papers appearing 
on this subject in various journals indicate the growth of this field and the ever- 
increasing interest of mathematicians and others. Functional equations used to be 
solved by assuming rich properties such as differentiability and reducing to differ- 
ential equations. Certain functional equations can be reduced to integral equations. 
Sometimes it is possible to find the solution of a functional equation on a dense 
set, and sometimes it is possible to reduce the functional equation to another func- 
tional equation that is already known. Hilbert, in 1900, in connection with his fifth 
problem (see Chapter 11), highlighted that, while the theory of differential equa- 
tions provides elegant and powerful techniques for solving functional equations, the 
differentiability assumptions are not inherently required, and so, to solve functional 
equations under general conditions, he stressed the general solutions. The trend now 
adopted by researchers in functional equations is to solve (treat) functional equa- 
tions without any (or mild) regularity assumptions. This effort has given rise to the 
modern theory of functional equations. The theory of functional equations forms 
a modern mathematical discipline that has developed very rapidly in the last four 
decades. Functional equations are studied and solved for their own sake without 
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assuming any regularity conditions. The reader will encounter this throughout the 
book. 

At present, there are exponentially ever-increasing numbers of mathematicians 
(functional equationists) and attractors (from other fields) from more than thirty 
countries. Researchers from Australia, Austria, Canada, China, the Czech Republic, 
Denmark, France, Germany, Greece, Hungary, India, Italy, Japan, Korea, Morocco, 
Poland, Romania, Russia, Spain, Switzerland, the United States, and others are 
engaged vigorously and passionately in studying functional equations and their ap- 
plications and determining their solutions. Several schools at Waterloo, Katowice, 
Krakow, Graz, and Debrecen actively pursue and contribute to the study and growth 
of functional equations. Primary journals catering to, contributing to, influencing, 
and encouraging this area are: Aequationes Mathematicae (from 1968), Publica- 
tiones Mathematicae Debrecen (from 1953), Glasnik Mathematicki, Annales Math- 
ematicae Silesianae, Demonstratio Mathematica, Reznik Nauk Dydakt, Prace, Cra- 
cowiensis, and Results in Mathematics. Annual international symposia have been 
held in Canada, various European countries, and the United States. with the num- 
ber of participants increasing to new heights. Periodic special conferences organized 
in Poland and Hungary devoted to functional equations and inequalities contribute 
greatly to the development of this area and attract new researchers. Aequationes 
Mathematicae publishes the reports of the meetings and exhaustive bibliographies 
on a regular basis. Professor Janos Aczél has been instrumental in these and the 
annual symposia, thereby promoting and spreading information and attracting and 
recruiting new researchers. These meetings help individuals to come into contact 
with and get to know many researchers in this field. 

These developments are sources of inspiration, attraction, and influence to bring 
more and more people to this interesting and ever-growing branch of mathematics, 
who in turn will enrich its study and scope. Hopefully interest in this field will 
continue forever. 

This book has many new features in addition to the usual expected ones. Its aim 
is to introduce and cover as many important equations, areas, and methods of solu- 
tion as possible. Some of the proofs follow the same lines as in other books, some 
are modified, some are direct and simple, and some are presented with new elegant 
proofs. Different methods of proof are presented. I have exploited associativity in 
many places. No general method of solution is possible. No one or dozen methods 
are enough to tackle functional equations, which could have been a discouragement 
to the study of solving them. Hopefully this book will remedy this problem and 
encourage many new workers to come study and develop this field. 

The literature in this field is voluminous beyond imagination. It is impossible 
to include all the references (even some relevant ones). The list is getting longer 
by the day. I had the difficult and unpalatable task of choosing better representative 
(encyclopedic) references. No offense is intended to anyone if their works are not 
cited. The reader is encouraged to refer to as many of the works as possible—both 
the books mentioned previously and the references in the bibliography. Many results 
have been discovered and rediscovered several times. This book, like the others 
before it, will remedy this to some extent. For historical details of many of the 
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equations treated in this book, refer to J. Aczél and J. Dhombres [44]. Writing this 
book gave me an opportunity to listen, to learn, to better understand, and to educate 
myself in this fascinating area. My motivation is to give the reader a bird’s eye view 
of what this field is about, its development, and its achievements by focusing on 
examples chosen from diverse areas and applications. 

Basic facts of general and linear algebra and calculus (analysis) are presumed. 
We do not distinguish functional equations of one and several variables. 
We emphasize that we deal with functional equations with many unknown func- 
tions and determine their general solutions. The reader will encounter this in many 
places. Determining several unknown functions in one equation is a salient feature 
unique to functional equations. This book does not treat existence and uniqueness 
results, iteration theory, operator equations, difference equations, etc. The book has 
been written by drawing on a wealth of information from many publications—from 
other books, journals, and papers from different sources. 


The Organization of this Book 


This book contains seventeen chapters. The first chapter, as usual, contains the 
basic equations of Cauchy and Pexider and many of their generalizations, exten- 
sions, and direct applications. This chapter serves as a foundation for other chapters. 
Chapters 3 and 10 are devoted to trigonometric functional equations and functional 
equations from information theory and treat these subjects in greater detail than the 
other chapters. Chapter 4 is devoted entirely to the quadratic functional equation 
and its generalizations. It is followed in Chapter 5 by its connected interest, inner 
product spaces, highlighting functional equations’ connection to i.p.s. Chapter 7 is 
devoted to the characterization of polynomials, Chapter 8 to nondifferentiable func- 
tions, and Chapter 9 to the characterization of groups, loops, and closure condi- 
tions, bringing out Thomsen and Reidemeister closure conditions, Bol and Moufang 
identities, and their functional equation generalizations. Chapter 11, Abel Equa- 
tions and Generalizations, is devoted to Hilbert’s fifth problem and its connection to 
information theory. A surprising link between the functional equation of Abel and 
the well-known fundamental equation of information (FEI) is another special fea- 
ture. Chapter 2, Matrix Equations, is brief. Chapter 6, Stability, is also brief since 
many monographs and books [385, 404] are already available. Chapter 12, Regu- 
larity Conditions—Christensen Measurability, covers this material for the first time 
in book form. Chapter 13, Difference Equations, is devoted to the familiar Cauchy 
and Pexider differences. Chapter 14 covers the characterization of special functions 
such as gamma and beta functions. Chapter 15, Miscellaneous Equations, is devoted 
to many methods and approaches to determine solutions of equations and will be of 
interest. Chapter 16, General Inequalities, is also an interesting chapter, including 
both applications and general discussion. Chapter 17, devoted to multiple applica- 
tions in various fields that are not already covered in the other chapters, will also be 
of great interest. 

For easy reference and convenience, there are sections at the end of the book on 
notation and symbols, an author index, a subject index, and a large bibliography that 
will prove helpful. 
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Regarding the general use of specific letters, A is additive, B is biadditive, D is 
derivation, EF is exponential, and L is logarithmic. 

Flashback: When I went to the University of Washington, Seattle, in the early 
1960’s, my intention was to work in measure theory, and Professor Edwin Hewitt 
was on sabbatical in Australia at that time. Surprisingly after some time, I received 
a note from him directing me to look at Flett’s [280] paper on cosine equations and 
see whether I could generalize the result. What I did not realize at that time was that 
I was hooked on functional equations, which I do not regret. My first encounter with 
functional equations and functional equationists came in 1966 at the Oberwolfach 
meeting. There were fifteen to twenty participants including Aczél, Barner, Benz, 
Eichhorn, Haupt, and Ostrowski, and the first speaker was Kuczma (in German). 
Ever since, the functional equation symposium has met regularly on an annual basis 
with few exceptions. I have had the opportunity to get to know personally an ever- 
increasing number of participants. Interest has grown many-fold, and the number of 
participants at the 2005 meeting was between 100 and 150. I do not see any letup in 
the activities of the researchers, and I have no doubt that such activity will continue 
in the years to come since it is a passion for many of the participants. 

Professor Michiel Hazewinkel (CWI—Amsterdam), who is the managing editor 
of the principal mathematics series Mathematics and Its Applications of Kluwer Aca- 
demic Publishers now Springer, invited me to write a book on functional equations 
which might be appropriate for Springer and expected this would play a part in their 
expansion. I accepted promptly with pleasure and full of hope. My first instinct was 
collaboration, and I enlisted two colleagues. I realized after some time it was not 
moving well. Without losing heart, I tried and got two new colleagues for the joint 
venture. Alas, it did not materialize either and also ended in disappointment. It took 
some time for me to realize that the joint work idea was not working and not feasi- 
ble, and I resolved not to give up and to go it alone and honour my commitment to 
Professor Hazewinkel. The result was that writing this book took an unexpectedly 
long time. In the meantime, there was a change in the publisher to Springer. 

At the time of my retirement, the chairman of my department, who was aware 
I was writing a book, said that continued secretarial and other assistance would be 
provided. But the person who succeeded him in a few months time had different 
ideas! He said that no secretarial assistance would be provided to typeset the book 
and one day unceremoniously left the manuscripts with the secretary to put in my 
mailbox. What can I say about this immature, thoughtless behaviour? 

Usually one acknowledges support from granting agencies. Here it is different, 
and I present a couple of NSERC of Canada referee reports to highlight the biased 
considerations: “My area of expertise is not close to that of the proposal”; “I do not 
believe that functional equations is a desirable area for training, on the positive side it 
is clear that a knowledge of functional equations can be applied to a variety of areas. 
Caveat: I am not expert in this side of functional equations. My views of necessity 
are those of an interested outsider’; “In other areas of the proposal where the main 
objective appears to be writing a book or monograph or survey paper, it is difficult 
to measure the scope. While there is some merit to writing a book or survey paper, it 
would have been more desirable to have a greater focus put on finding solutions ...”’; 
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“The Committee did not note any improvement in the applicant’s research record 
since the last decision” (sweeping statement); “Some items of the research program 
are quite original”; “according to my experience this type of method requires a lot of 


29, 66. 29, 66. 


ingeniousness”; “the applicant is an expert in the proposed area”; “these results are 


29, 66 


significant contributions”; “certainly one of the leading figures in this area’; “of the 
five NSERC applicants I assessed this year, I would rank him in the top two”; “as 
also noted, Professor Kannappan’s work has been well cited”; “Professor Kannappan 
maintains a strong rhythm of publications on various areas of functional equations”; 
“such a book from Professor Kannappan should be strongly encouraged”; “the ref- 


29, 66 


erees’ comments on the applicant were almost uniformly positive’; “originality and 
innovation”; “good”. These comments had no effect or were ignored by NSERC of 
Canada. Is there a redemption for NSERC? 

In spite of all these impediments, frustrations, and disappointments, I was not 
discouraged, nor did I give up or lose sight of my goal. I worked consistently to 
accomplish the final goal—writing this book. I take responsibility for both the credits 
and defects of this book. 

It is my pleasure to express my deepest appreciation to the untiring Lis D’ Alessio 
for her patience, understanding, cheerful attitude, and excellent typing. I am truly 
grateful for her sustained help over the years to make it possible to accomplish this 
task. No words can adequately describe her work throughout the entire period of 
writing this book. 

I thank Springer for bringing out this book elegantly and for their patience. 

Spread the message of functional equations. Your comments on all aspects of the 
book are welcome. 


Pl. Kannappan 
Waterloo, 2008 
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Basic Equations: Cauchy and Pexider Equations 


In this chapter, Cauchy’s fundamental equations are studied, some algebraic con- 
ditions and generalizations are considered, and alternate equations, conditions on 
restricted domains, Jensen’s equation, some special cases of Cauchy’s equations, 
extensions of the additive equations, Pexider equations and their extensions, some 
applications in economics, the allocation problem, and sums of powers of first n 
natural numbers are treated. 

There is a saying in Tamil, “paruppilla kalyanama”—marriage without lentils. 
(In the North American context, can one think of Thanksgiving without turkey?) 
Similarly, can one say “a book on functional equations without Cauchy’s equations?” 
As Cauchy’s equations permeate functional equations and are fundamental in nature, 
we first treat the Cauchy equations 


(A) A(x + y) = A(x) + AQ) (additive), 

(E) E(x+y)= E(x*)EQ) (exponential), 
(L) L(xy) = L(x) + LO) (logarithmic), 
(M) M(xy) = M(x)M(y) (multiplicative), 


treated systematically in Cauchy [147], Aczél [12], and Kannappan [458]. Even 
though these equations occur earlier in the studies by Oresme [655, 656, 44], Euler 
[262, 263], Gauss [303], Stifel [777], Briggs [134], Biirgi [139], Kepler [531], 
Lacroix [592], Legendre [605], de Sarasa [727], etc., in the literature, these equa- 
tions are known as Cauchy’s equations. The equation (A) finds applications in almost 
every branch of mathematics. It appears in the problem of the parallelogram law of 
forces, in the problem of the measurement of areas, in the problem of simple interest, 
in projective geometry, in mechanics, in the theory of probability, in non-Euclidean 
geometry, etc. Cauchy’s equations are used in the mathematics of finance, in adver- 
tising, in production theory, in economics, in taxation, in modelling, in probability 
theory, and in many other areas. The equation (A) is one of the equations that has 
been extensively studied or explored and was solved by, among numerous authors, 
Aczél, Alexiewicz and Orlicz, Banach, Darboux, Erdés, Fréchet, Gauss, Halperin, 
Hamel, Hewitt and Zuckerman, Kuczma, Kurepa, Kuwagaki, Legendre, Ostrowski, 
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and Sierpinski, under various hypotheses of the function, domain, and range. We will 
deal with them in detail later. The existence of discontinuous solutions of (A), due to 
Hamel [339], is based on the axiom of choice. Where the domain and range of (A) are 
abstract sets (groupoids, groups, fields, etc.), these equations play an important role 
as the equations of homomorphism, endomorphism, isomorphism, automorphism, 
etc. The Cauchy equations play a prominent role in the theory of functional equa- 
tions and have fascinated investigators. Researchers fell in love with these equations, 
and the romance will continue and will result in many more interesting results. These 
equations are the subject of study under various contexts. 


1.1 Additive Equations 
First we consider the (Cauchy) additive equation (A) 
(A) AQ + y) = AQ)+A(Qy). 


We begin with the domain and range of A being reals. Let A : R — R satisfy the 
equation (A). Then A is said to be additive. 
Letting x = 0, y = 0 in (A), we have 


(1.1) A(0) = 0. 
Now, changing y into —x in (A) and using (1.1), we get 
(1.2) A(—x) = —A(x) forall x €R; 
that is, A is odd. From (A) and by finite induction, it follows that 

AQ + x2 +--+ + Xn) = AQ1) + A(x) + +++ + AGn). 
Setting x; = x (j = 1,2,...,m) in the above and using (1.2), one gets 
(1.3) A(nx) =nA(x) forx €R, n€ Z. 


Let n(¥ 0) be in Z. Replace x by = in (1.3) to obtain 
1 
(1.4) A (=) = A(x) forx ER, neZ*. 
n n 
Letr = a be any rational. Then (1.3) and (1.4) yield 
(1.5) A(rx) =A (m ~) =mA (-) =" AG) =rAG@) 
n n n 


for x € R, r € Q. That is, A is rational homogeneous. 
With x = 1, (1.5) becomes 


(1.6) A(r)=cr, forreéeQ, 


for some constant c = A(1). So far only the condition that A is additive is used. 
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Now, if we assume that A is continuous or continuous at a point, we can conclude 
that 


(1.7) A(x) =cx forallx ER. 


Indeed, from the continuity and (1.6), (1.7) follows easily since Q is dense in R. 
On the other hand, suppose A is continuous at a point x9 € R and yo is an arbitrary 
real number. Let y, — yo, where y, € R. Then y,— yo+xo9 — xo and the use of (A) 
and the continuity at xo yield A(y, — yo + x0) — A(xo); that is, AQyn) > AQ), 
showing that A is continuous at yo. Hence A is continuous on R and (1.7) holds. 
Thus, we have proved the following theorem. 


Theorem 1.1. Let A be a real-valued function of a real variable satisfying the addi- 
tive equation (A). Then, if A is either continuous or continuous at a point, it has the 


form (1.7). 
Remark 1.1a. If A : R — R is a nonzero solution of (A), then A is unbounded. 


There are as many conditions known in the literature as one can imagine for the 
solution of (A) to be (1.7) and thus continuous. The condition of continuity can 
be weakened considerably in (A) to obtain the same conclusion. Several equivalent 
conditions for continuity and proofs are known in the literature (Aczél [12], Aczél 
and Dhombres [44], Kannappan [426], and Kuczma [552, 558]). In the next few 
theorems, we achieve the goal of continuity. 


Theorem 1.2. Suppose A : R — R satisfies (A) with c = A(1) > 0. Then the 
following conditions are equivalent: 


(i) A is continuous at a point xo. 
(ii) A is monotonically increasing. 
(iii) A is nonnegative for nonnegative x. 
(iv) A is bounded above on a finite interval. 
(v) A is bounded below on a finite interval. 
(vi) A is bounded above (below) on a bounded set of positive Lebesgue measure. 
(vii) A is bounded on a bounded set of positive measure (Lebesgue). 
(viii) A is bounded on a finite interval. 
(ix) A(x) = cx. 
(x) A is locally Lebesgue integrable. 
(xi) A is differentiable. 
(xii) A is Lebesgue measurable. 


Proof. We will prove the implications in the same order as stated (also see Aczél 
[12] and Kannappan [426]). 


To prove that (i) ==> (ii), let x > y. Let {r,} be a sequence of rationals such that 
tin > xX -—yasn > ow. Thenr, + y-—x+x0 > x9 asin — ov. By (i) and 
the additivity of A, one gets A(r, + y — x + x0) > A(Xo); that is, cr, + A(y) — 
A(x) > Oasn —> oo. But cr, + A(y) — A(x) > cw — y) + A(y) — A(x); that 
is, A(x) — A(y) + c(y — x) = 0, which shows that A(x) — A(y) = c(x — y) > 0 
(recall c = A(1) > 0). Hence A is monotonically increasing, which proves (ii). 
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To show that (ii) ==> (iii), suppose x > 0. Then, by (ii), A(x) => A(O) = 0, which is 
(iii). 

Remark 1.3. Note that, since A is odd (it satisfies (1.2)), A is nonpositive for non- 
positive x. 


To prove that (iii) => (iv), suppose x € [a, b], a finite interval. Then b — x > 0. 
By (iii) and the additivity of A, A(b) => A(x); that is, A is bounded above on [a, b] 
by A(b). 
To show that (iv) = >(v), let [c, d] be any finite interval and x € [c,d]. Now A is 
bounded above on [c — x,d — x] by (iv); that is, A(c — x) < m for some m or 
A(x) => A(c) — m, which shows that A is bounded below on [c, d]. 
To prove that (v) ==> (vi), let us prove that A is bounded above on a bounded set F of 
positive Lebesgue measure. There exists a finite interval [a, b] such that F C [a, b]. 
Then, for any x € [a,b], consider [a — x, b — x]. By (v), A is bounded below so 
that A(a — x) > m or A(x) < A(a) — m for some m; that is, A is bounded above on 
[a, b]. Hence A is bounded above on F’.. Note that A is bounded on F. 
To show that (vi)= => (vii), suppose A is bounded below on a bounded set F of 
positive measure. It is well known that F + F has a nonempty interior (see [566, 
578, 567, 766, 533, 696, 742, 44]). Then there is an interval [c,d] C F + F, and A 
is bounded below on [c, d]. As in the proof of (v) ==> (vi), it is easy to show that A 
is bounded on F. This proves (vii). 
Remark 1.3a. P. Erdés observed that A may be bounded on a set C for which C —C 
contains an interval and that A is not continuous. 
To prove that (vii) ==> (vili), let |A(x)| < m for x € F, a set of positive measure. 
By a theorem of Steinhaus [766], there is a positive number 6 such that every real 
number ¢ satisfying |t| < 6 may be expressed as x — y for suitable x, y € F. Then 
|A(t)| = |A@ —y)| < 2m for t € ]—6, o[. Thus A is bounded on | — 6, o[ and hence 
on any finite interval. This proves (viii). 

Also note that a finite interval is a bounded set of positive measure. 
To show that (viii) => (ix) (Aczél [12], Darboux [190]), suppose A is bounded on 
[a, b]. Define 


(1.8) d(x) =A(x)—cx forx ER. 


Then ¢ is also additive on R, ¢(r) = 0, forr € Q, and ¢ is bounded on [a, b]. 
Furthermore, 


(1.9) d(x+r)= (x) forx ER, reQ. 


Since for every real x we can find a rational r such that x +r € [a,b], we can 
conclude from (1.9) that ¢ is bounded everywhere on R. Now claim that d(x) = 0 
for all x € R. If not, there is an x9 such that d(x9) = b(# 0) > 0. Then d(nxo0) = nb 
holds for all n € Z, so, for arbitrarily large n, ¢ can take an arbitrarily large value, 
contradicting the boundedness of ¢. Thus A(x) = cx; that is, (ix) holds. 

(ix) ==>(x) is obvious. 
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To show that (x) ==> (xi), integrate (A) over [a, b] to obtain 


[ac 4 ay = | acay+ | aovay 


a a a 


that is, 


b+x 
| A(vo)dv = (b —a)A(x) +d. 
a+x 

Since A is locally integrable, the equation above shows first that A is continuous 
and then indeed A is differentiable. This proves (xi). 
(xi) implies (xi1) is obvious. 
Finally, to complete the cycle, we have to prove that (xii) ==> (i). To prove that 
every measurable solution of (A) is continuous (that is, of the form (1.7)), a number 
of proofs of this result are known (Sierpinski [740], Alexiewicz and Orlicz [69]). 
Here we present the proof due to Banach [96]. 


Proof. Let x9 € R, ¢ > 0, and Ja, b[ an arbitrary interval. By a theorem of Lusin, 


there exists for every measurable function f, (in particular here A) and for every 


o > 0 (in particular, ¢ = pea) a continuous function F (for all real x) such that 


A(x) = F(x) 


holds for all x € Ja, b[, except perhaps for x’s forming a set C of measure < o. The 
function F being continuous, for every ¢ > O there exists a 0 (< o) such that, for 
x €ja,bl, |F(«+h)—F(«)| < ¢ whenever |h| < 6. Since A(x) = F(x) fore Ja, b[ 
except over a set C of measure < o, we can conclude that A(x +h) = F(x +h) 
is true for all x € Ja, b[ except over a set C; of measure < o + |h| < o + 6. The 
set of x € Ja, b[ for which neither A(x) = F(x) nor Aw +h) = F(x +h) hold 
is therefore of measure < m(C UC) < 20 +6 < 3a < b—a, where m is the 
Lebesgue measure on R. Hence there is a point x € Ja, b[ (dependent on /) for which 
A(x) = F(x), Aw +h) = F@ +h), and |F(x +h) — F(x)| < € are valid. Hence 
we have |A(x +h) — A(x)| < e. Since Aw +h) — A(x) = A(xo +/) — A(X0), for 
any real x9, |A(xo +) — A(xo)| < e. Hence A is continuous. This proves (i). 


Corollary 1.4. [f A satisfies (A) on R and is measurable on some subset C of positive 
Lebesgue measure, then A is continuous. 


Proof. Because the set of x € C for which |A(x)| < 7 has positive measure for 
sufficiently large n, the proof follows by Theorem 1.2 (vii). 


Corollary 1.5. Let A : R > R satisfy (A). Suppose there is a subset C of positive 
Lebesgue measure on which A is bounded above by a Lebesgue measurable function 
f :R—R. Then A is continuous. 


Proof. (Dhombres [218]). Since f : C — R is Lebesgue measurable and the 
measure of C is positive, there is an integer k such that Ck, = {x € C: f(x) < k}is 
of positive measure. So, there exists a subset Cx of positive measure on which A is 
bounded above. Thus, by (vi) of Theorem 1.2, A is continuous. 
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1.1.1 Discontinuous Solutions 


Whereas the solution of (A) under the hypothesis of continuity is of the form (1.7), 
Hamel [339] bases are used successfully in constructing a function A (using the 
axiom of choice), which is a solution of (A) that is not of the form (1.7) and so is not 
continuous (see also Aczél and Daréczy [43], Aczél [12], Kuczma [566]). 

If A is not continuous, how does it look? Obviously, any discontinuous solution 
of (A) on an interval is unbounded (see (viii) of Theorem 1.2). First we construct the 
most general solution of (A). Then we show the existence of discontinuous solutions 
of (A). As a matter of fact, we prove the following theorem. 


Theorem 1.6. Let H be a Hamel basis of Rand f be defined arbitrarily on H. Then 
there exists an A: R > R that is additive and A|H = f. 


Proof. Let H be a Hamel basis of R over Q rationals. Let f be defined arbitrarily on 
H. Every real number x can be represented uniquely as a finite linear combination 


m 


. rete, re € QO, hy € A. 
k=1 


Define A : R > R by 


m 
A(x) = Do ref (hk). 
k=1 

Clearly this A is a solution of (A), and every solution of (A) can be written in this 
form. 

Obviously, A|H = f. It is easy to check that such an A is continuous if and only 
if 

f(A) = chy, forh, € A, 

for some constant c. 

In order to obtain a discontinuous solution of (A) or solutions that are not of the 
form (1.7), choose two Hamel basis elements h;, 2 in H such that 


fairy/hi F f(ha)/ha. 


Such solutions are nowhere continuous (refer to Theorems 1.1 and 1.2). 
This proves the theorem. 


Theorem 1.6a. (Aczél [44], Eichhorn [250]). Let the real function A satisfy (A) and 
not be continuous. Then the graph of A = {(x, A(x))|x € R} is dense in R?. 


Proof. Pick t; # 0. Since A is not continuous, there is a t2 4 O such that 


ae # a (otherwise A(t2) = cfz for all t2 € R). This means (f;, A(t,)) and 


(t2, A(t2)) are linearly independent and form a basis for R*. Let (x, y) € R*. Then 
there exist constants c; and cz in R such that (x, y) = c(t), A(t1)) + c2(f2, A(h)). 
Choose rationals 7; and r2 as close to c; and c2 (respectively) as desired. Then 
(rit) + rote, A(t) + r2t2)) = (rity + rote, A(t) + r2A(t2)) is as close to 
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(cit) + cote2,c,A(,) + c2A(f2)) as desired; that is, an element on the graph of A 
is as close to (x, y) as desired. So, the graph of A is dense in R?. 


Remark 1.6b. These results reveal that solutions of (A) are either very regular or 
extremely pathological. 


Corollary 1.7. Let A be a noncontinuous solution of (A). Then the image of any 
open interval by A is dense in R. 


Proof. Let |a, b[ be any open interval. Suppose A (Ja, b[) the image of Ja, b[ by A, 
is not dense in R. Then there is an open interval |c, d[ that has void intersection with 
A(]a, b[). Now the open set C =a, b[ x Jc, d[ in R? has no point of the graph of 
(A), contrary to Theorem 1.6a. 


1.2 Algebraic Conditions—Derivation 


Theorem 1.2 shows that a solution of (A) under some regularity condition is continu- 
ous and is of the form (1.7). The question now arises as to whether one can look for 
supplementary algebraic conditions of A implying the continuity of A. This may 
lead to many interesting problems and representations of A (see Problem 1.16). First 
we deal with questions raised by Halperin [335] and Rothenberg [712]. 


Problem 1.8. (I. Halperin [335]). Suppose A : R — R is additive and satisfies 


(1.10) A (<) = + Aw), x(£0) ER. 
x Xx 


Does it follow that A is continuous? 


The answer turns out to be yes. Several proofs are known (Jurkat [406], Kurepa 
[584, 585, 586],Kannappan [426]). Here we present a different proof. 
For x 40,1, —-1, 


1 1\7 x 
a( _ ~) = (« — ~) A (4) (use (1.10)) 
x x x-—] 


_@- ys 1 1 A 
= a(t) (use (A)) 
2 _ 2 
2x 2x2 
2 
exe as ee ah (use (A)) 
x x 


=A(x)—A (<) = A(x) — TAGs. 
x x* 


Hence A(x) = A(1)x, forx 4 0,1, —1. Evidently, this relation holds for x = 0, 
1, —1 also. Thus A is continuous. 
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Derivation 
A mapping D : R > Ris called a derivation provided D satisfies (A) and 


(D) D(xy)=xD(y)+ yD), forallx,y eR. 


Proposition 1.9. Suppose D : R > R is additive, that is, D satisfies (A). Then the 
following are equivalent: 

(i) D satisfies (D). 

(ii) D (+) =-4D(x) forx £0. 
(iii) D(x”) = 2x D(x) forx € R. 

(iv) D(x”)? = 4x?D(x)?. 
Proof. We prove in the order listed above and take D 4 0. 
To prove that (1) ==> (it), first of all, notice that D(1) = 0. The y = + in (D) gives 
(ii). 
To show that (ii) ==> (iii), again note that D(1) = 0. For x 4 +1, using (A) and (ii), 
we have 


2) _ 2 _ 242 1 _ (x2—1)° 1 1 
D(x") = D(x*—1) = —(x*—-1) (4) ate (—-—) 
_ @+1)? (x —1) 
are eer ae a 


2 
D(x —1)- D(x +1)=2xD(x) forx #41. 


Obviously, D(x”) = 2x D(x) holds for x = +1 also. So, (iii) is true. 
(iii) implying (iv) is obvious. 
To complete the cycle, we will show that (iv) ==> (i), first note that D(1) = O and 
so D(r) = 0 forr € Q. Change x to x +r, where r is a rational in (iv), and use D 
additive, D(r) = 0, and (iv) to get D(x”) = 2x D(x) (which is (iii)). For x, y € R, 
use (A) and (iii) to get 


D(x + y)?) = 2 + y)D@ + y) = 2 + y)(D&) + DY) 
= D(x”) + 2D(xy) + D(y”); 


that is, (i) is true. This proves the proposition. 


Remark 1.10. If D is a derivation, then D(r) = 0 for all r € Q. If a derivation D 
is continuous, then D = 0. There exist nontrivial derivations, and derivations can be 
defined on abstract structures such as rings, and fields (see Zariski and Samuel [839], 
Kuczma [558], and Aczél and Dhombres [44]). 


Problem 1.11. (Rothenberg [712]). Suppose the real function A is additive with 


(1.11) A(x)A (-) >0 forx(#0)ER. 
Xx 


Is A continuous? 
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The answer turns out to be no this time. First we consider a stronger condition, 
(1.12), than (1.11) ([445, 30]), 


(1.12) A(x)A (-) =b(>0) forx £0. 
x 


Observe that A is not constant and that A(1)? = b. Then, from (A) and (1.12), for 
x #1,—1, we obtain 


1 1 1 1 1 
A =-A —-A : 
x2-—1 2 x—-1 2 x+1 


oe : 7 
A(x?)— A(I)-2(A@®)— AM) -2(A@®) +. A)” 


that is, 


which yields A(x7)A(1) = A(x)?, so that A(x) > 0 for x > 0 if A(1) > 0 
(or A(x) < 0 for x > Oif A(1) < 0). In either case, A is continuous (see (iii) 
of Theorem 1.2). Then A(x) = Vbx or A(x) = —Vbx, according to whether 
A(1) > Oor A(1) < 0. 

Now we present the solution due to Benz [117]. Consider a noncontinuous auto- 
morphism ¢ : C —> C, and put A(x) = Re ¢(x) for all x € R. Then A is additive 
on R and A is discontinuous. Moreover, 


1\ _ A(x) 
a(-)- po forx € R, x £0, 


where g(x) = Im ¢(x) (use ¢(x)d(4+) = 1). Hence, 1 > A(x)A(4) = 0 for all 
x (€ 0) in R. Benz ends this remark with a comment: If there exists a discontinuous 
automorphism ¢ of C such that d(x) ¢ iR for all x € R*, then this automorphism 


would lead to a counterexample 1 > A(x)A + > 0 forall x € R*. 


Bergman [120] has constructed such an automorphism ¢ : C > C. 


Proposition 1.12. Let A : R > R be additive and satisfy 


(1.13) A(xy)=A(x)A(Qy) forx,yeER. 


Then A is continuous and either A = 0 or A(x) = x; that is, if A is additive and 
multiplicative, then A = 0 or is an identity map. 


Proof. Put y = x in (1.13) to get 
(1.14) A(x’) = A(x)? forx ER; 


that is, A(x) > O for all x > 0. Since A is also additive, by (iii) of Theorem 1.2, A is 
continuous and A(x) = cx. This A in (1.13) gives c = c?; that is, c = 0 or 1. This 
proves the proposition. 
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Remark 1.13. [Aczél [12], Aczél and Dhombres [44], Kannappan [426]]. All auto- 
morphisms A : R > R are given by A(x) = x. 


Proposition 1.14. Let A : R — R satisfy (A) and 
(1.15) A(x”) = A(x)" forx ER, 


where m is an integer different from 0 and 1. Then A is continuous and either 
A=0, A(x) =x, or A(x) = —x. 


Proof. (See also [218].) Note that (1.14) is a special case of (1.15) and that A = 0 is 
a solution of (1.15). So, we look for a not identically zero solution A. 

First let us consider the case where m is a positive, even integer. Then A(x) > 0 
for x > 0. Thus, by (iii) of Theorem 1.2, A(x) = cx withc = c”. Thenc = Oor 1, 
giving the solution A(x) = x. 

Assume now that m is a positive, odd integer. Choose any rational r (4 0) and 
replace x by x +r in (1.15) to obtain 

m 
1 


[A(x) + A()r]” = A(x)” + ( )aco”"acnr +--+ A(1)"r™ 
= A(x +1") 


= A(x”) + (T)aamby +etr™A(I). 


Treating the right sides as a polynomial in r and equating the coefficients of r, we 
have A(1)A(x)”~! = A(x”7!). Since m is odd, A(x)”~! > 0. Hence, if A(1) > 0, 
then A(x) > 0, andif A(1) < 0, then A(x) < O for x > 0. In either case, A is 
continuous by (iii) of Theorem 1.2 and A(x) = cx with c = c”™. Since m is odd, 
c = 0, 1, or —1, yielding the solutions A(x) = x and A(x) = —x. 

Finally, let us assume that m is a negative integer different from —1. Ifm = —1, 
then (1.15) gives A(x)A(4) = 1, a special case of (1.12), and A(x) = x or A(x) = 
—x (see Problem 1.11). 

Now, by (1.15), A(x") = A(x)", m? being a positive integer different from 1. 
By the previous case, A(x) = x if m? (and thus m) is even or A(x) = —x if m? (that 
is, m) 1s odd. This proves the proposition. 


Proposition 1.15. (Kannappan and Kurepa [486], Kannappan [458]). Suppose that 
A:R-— Ris additive and 


(1.16) A(x") =x"""A(x""), forx ER, 


holds for all nonzero integers m andn, m # n. Then again A is continuous and 


A(x) = A(Q1)x. 


Proof. Without loss of generality, take n > m. If n < m, we can consider A(x”) = 
KHPA (x): 
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Replace x by x +r, where r is a nonzero rational in (1.16), to have 


A (» + (ter of actia fs (, a an +r) 
=A Ga + (Gea feeeH (," jee ne i") 
ee + (" F oak corer * ( n—-m = ae Pa ; 
n—-m— 


Treating the equation above as a polynomial in r and equating the coefficient of r 
on both sides, we obtain A(x) = A(1)x. This proves the proposition. 


n-1 


Remark 1.15a. Forn = 1, m = —1, (1.16) becomes the Halperin Problem 
(Problem 1.8). 


1.2.1 More Algebraic Conditions 


Now we treat the following more general (equation) problem. Let V be the set of 
all additive functions A : R — R. Then V is a vector space over R. Let U be the 
subspace of V spanned by x — A(x) = A(1)x, continuous functions, and by all 
derivations on R. 


Problem 1.16. (Kannappan and Kurepa [486], Kannappan [458]). Let u;’s be ratio- 
nal functions in x, the p;’s be continuous functions on R except at the singular points 
of u;, and the A;’s be additive functions. When does a condition of the form 


(1.17) > pilx)AiWie)) =0, forx ER, 
i=l 


imply that A; ¢ U @ = 1,2,...,n); that is, that the A;’s are linearly dependent 
relative to U? 


This problem has not been solved completely. Here we present a result from 
[486] that includes Halperin’s Problem (Problem 1.8) and Proposition 1.15 as special 
cases. We will start with a lemma. 


Lemma 1.17. An additive D : R > R is a derivation if, and only if, D satisfies 


(1.18) D(x?") = —nx"*!D(x""!) 4 (n+ I)x"D(x"), forall x ER, 


with D(r) = 0, forr € Q, where n is a positive integer. 


Proof. When D is a derivation, it is easy to check (1.18). Suppose (1.18) holds. Note 
that, when n = 1, (1.18) is (iii) in Proposition 1.9. 
Replace x by x +r, (r € Q) in (1.18) to obtain 


D(x +r)*") = —n(r +x)" D((r +.x)""') + 24+ D+ x)"D(x+7)"). 
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Noting that D is additive, D(r) = 0, and the equation above is a polynomial in 7, 
equate the coefficient of r2”~? to get 


2n(2n — 1) n [= 1)(n — 2) 


5 D(x?) =— 5 De?) ++ 1)n—- xD] 
+ (n+ 1) [ew + xD] : 


which is D(x”) = 2x D(x). By (iii) of Proposition 1.9, D is a derivation. 


Theorem 1.18. (Kannappan and Kurepa [486]). Let Ai (4 0) and Az be additive 
functions from R into R and satisfy 


(1.19) Ai(x") = P(x)A2(x”) 


for all x #4 0, with P : R* — R as a continuous function with P(1) = 1 and 
m and n nonzero integers with m #4 n. Then P(x) = x"~". Further, if Di(x) = 
Aj(x) — Aj(1)x fori = 1,2, then Dj are derivations and nD, (x) = mDo(x). 


Conversely, if D; and D2 are derivations on R and Aj(x) = bx + D,(x), 
A2(x) = bx + D(x), where b is any real number, mD2(x) = nD (x), and 
P(x) =x", where m,n € Z, then Aj, Az, and P satisfy (1.19) for all x € R. 


Proof. It is easy to verify the converse part. First we shall show that P(x) = x”~". 
Replacing x by rx, r any rational in (1.19), and using (1.19), we have 


[7 " P(rx) — P(x)JA2(x") = 0. 


Since Az 4 0 and P is continuous with P(1) = 1, the equation above yields P(x) = 
x"~" | Now (1.19) can be rewritten as 


(1.20) Ai(x") = x" Ao(x™), x £0. 


Let Dj(x) = Aj(x) — Aj(1)x for x € R, i = 1,2. Observe that A; (1) = A2(1). 
First we consider the casen Am, n,m > 0. 

We subdivide this case into three subcases: m = 1, n = 2; m = 1, n > 2; and 
m>2,n>m. 


Case m = 1, n = 2. Now (1.20) becomes 

(1.21) Ai (x?) =xA2(x) forx £0. 
Replacing x by x + y in (1.21) and utilizing (1.21), we have 

(1.22) 2Ai(xy) = xA2(y)+ yAo(x) forx,yER. 

Set y = | in (1.22) to get 


(1.23) 2A1(x) = Ao(x) + Ao(Lx 


1.2 Algebraic Conditions—Derivation 13 
and 2D 1(x) = D2(x) since A; (1) = A2(1). From (1.23) and (1.22), we have 
Az(xy) + A2(l)xy = xA2(y) + yA2(x); 


that is, Do(xy) = x D2(y) + yD2(x). Thus D2 and hence Dy are derivations. 
Case m = 1 andn > 2. Now (1.20) becomes 


(1.24) Aq(x") = x""! An(x). 
Changing x into x +r, with r any rational in (1.24), we obtain 
Ai(@ +r)") = @ +r)""TAa(e) + Aa()rI. 


Treating the equation above as a polynomial in r, equate the coefficients of r”~! and 
r”—? to get 


(1.25) nA,(x) = A2(x) + (n — 1)A2(1)x 
and 
(1.26) (5) 10%) = (n — 1)xAp(x) + (" ‘ ‘arte? 


respectively. Equation (1.25) gives nD, (x) = Do(x). 
Use of (1.25) in (1.26) yields 
nA (x7) = 2xAo(x) + (n — 2)A2(1)x? 
= 2x[nAi(x) — (n — 1)A2(1)x] + (n — 2)A2(1) x? 


or 
Aj(x7) — Ay (1)x* = 2x (Ai (x) — Ai (Ix); 


that is, D, (x?) = 2x D,(x), showing thereby that D, is a derivation. Hence D2 is 
also a derivation. 
Case m > 2 andn > m. So, n > 3. As before, replace x by x +r, r € Q, in (1.19) 
to have 

Ai(@ +r)") = @ +r)" "Ao(@ +1r)”). 


Now comparing the coefficients of r”~! and r”~? on both sides, we get 
(1.27) nA (x) = mA2(x) + (n — m)A2(1)x 

and 

(1.28) me Dave?) = mn) ante?) + (n — m)mx A(x) 


(n —m)(n —m— 1) 


Ap(1)x?. 
5 2(1)x 
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Now (1.27) gives the nD, (x) = mD2(x) we seek. To prove that D; is a derivation, 
use (1.27) in (1.28) to have 


n(n — 1)Aq (x7) = (m — 1)[n Aq (x) — (n = mA (1x7) 
+ 2(n — m)x[nA,(x) — (n —m)A\(1)x] 
+ (n—m)(n —m — 1)Aq(1)x?; 
that is, 
Ai (x?) — Ai (Dx? = 2x(Ar(@x) — Ai (Dx), 


showing thereby by (iii) of Proposition 1.9 that Dj is a derivation. So, D2 is also a 
derivation. 


Case n, m > Oandn <™m can be dealt with similarly by considering A2(x”) = 
x™—"Aq(x”). 
Case n, m < 0 follows from the case above by replacing x by + in (1.20). 


Finally, we consider the case n > 0 and m <0. We now treat n > O and 
m = —1; that is, 


Xx 


(1.29) Avs") =a"*"a2 (2), x £0. 


Forr € Q, using (1.29), we get 
2rA : =A : A : : 
COIN SE gk J EN ae *Vear)’ 


2r Ai (xe? —17)") = +r)" A(@ — ry") — @ — rl AL(ae +)"). 


that is, 


Now, comparing the coefficient of r on both sides, we have 
Ay (x) = —nx"1 Ay (x7!) + 2 + 1x" Ai 0"); 


that is, Dy (x2”) = —nx"*! Dy (x”—!)4+(n41)x" D(x"), which is (1.18). By Lemma 1.17, 
Dy, is a derivation. Equation (1.29) can be rewritten as 


Dy(x") + Ai(1)x” = x"*} E (-) + A2(1) (-)| : 


from which, by using Dj as a derivation, A2(1) = Aj (1), and replacing x by + gives 
nD\(x) = —Do(x). Hence Dz is also a derivation. : 

For this case and the general casen > 0, m < 0, see also [486, 558, 327]. This 
proves the theorem. 


Corollary 1.19. Halperin’sroblem (1.8) and Proposition 1.15 follow from (1.29) and 
(1.20) with Az = Aj. 
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We will consider another special case of (1.17) and prove the following proposi- 
tion. 


Theorem 1.20. (Kannappan and Kurepa [486], Kuczma [558]). If Aj : R > R 
(i = 1, 2) satisfy (A) and 


(1.30) A(t) = —t7A2 (-) , t(#0ER, 


then Dj(t) = Aj(t) — Aj()t, (J = 1, 2) are derivations, D; = D2, and A, — A2 
is continuous. 


Proof. For r € Q, from 


2rA : =A ! A : 
wae x2 — 72 OTe =f 2 x+r 


and (1.30), we have 


2r Aq (x? =f") = (x +r)Ai(x —r)-(x —ry A(x +r). 


By collecting the coefficient of r, we get A(x) = 2x A(x) — Ai (1)x?; that is, D, 
is a derivation. Note that A;(1) = —A2(1), and (1.30) can be rewritten as 


1 1 
Di(x) + Ai (1)x = —x? E (-) + ant) | 
or 
2 1 
Do(x) = —x" Dy = Dj (x). 
Hence Dz is also a derivation. 


Now Aj(x) — A2(x) = Di (x) + Ai ()x — Dox) — A2C)x = 2A1(1)x. This 
proves the proposition. 


1.3 Additive Equation on C, R”, and Ry 
1.3.1 Additive Equation on Complex Numbers 
Theorem 1.21. The general solution A : C — C of (A) is given by 


(1.31) A(z) = Ai(x) + iA2(x) + A3(y) +7 Aa(y), 


where z =x +iy, x,y € R, and A; : R > Rare solutions of (A) (j = 1, 2,3, 4). 


16 1 Basic Equations: Cauchy and Pexider Equations 

Proof. Let z =x +iy, w=u+iv, x,y,u,v € R. Further, let 

(1.32) A(z) = Bix, y) +i Bo, y). 

Then, since A satisfies (A), it is easy to see from (1.32) that 

(1.33) Bj(x +u,y+v) = B;(x,y)+ Bj(u,v), jf =1,2. 

Set y = 0, vo = 0 in (1.33). Then, with 

(1.34) Aj(x) = Bj(x,0), xe RG =1,2), 

we see that A; (j = 1, 2) satisfy (A) on R. Similarly, we obtain by defining 

(1.35) A3(y) = Bi, y), Aa(y) = B20, y),  fory eR, 

that A3 and Ay are solutions of (A) on R. From (1.23), (1.34), and (1.35), we have 
Bi (x, v0) = By (x, 0) + Bi (0, v) = Aix) + A3(0) 


and 


Bo(u, y) = Bo(u, 0) + Bo(0, y) = A2(u) + Ag(y). 


Hence (1.31) holds. This completes the proof of this theorem. 


Corollary 1.22. A complex-valued function of a real variable A : R > Cisa 
solution of (A) if and only if 


(1.36) A(x) = Ai(x) +iA2(x), x ER, 


where A; : R > Rare solutions of (A) (j = 1,2). 


Theorem 1.23. The general continuous solution A : C — C of (A) is given by 
(1.37) A(z) =cz+ dz, 


where c and d are arbitrary complex constants. The same conclusion holds when A 
is either continuous at a point or measurable. 


Proof. From Theorem 1.21, we see that A; (j = 1, 2,3, 4) are continuous, continu- 
ous at a point, or measurable on R. Hence, by Theorem 1.2, A; (x) = cjx, forx € R, 
and c; (j = | to 4) are real constants. Hence A(z) = cix +icox + c3y +icay = 
cz + dz for complex constants c and d. This proves the theorem. 


Remark 1.23a. The general differentiable solution of (A) where A : C > C is 
given by A(z) = cz, where c is a complex number, since Z is not differentiable. 
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Theorem 1.24. The most general solution A : R" — R of (A) has the form 
(1.38) A(x1,%2,°++ 5 Xn) = A1(x1) + Ao(x2) +--+ + An Gn), 


where Ax : R > R satisfies (A) fork = 1 ton. If A: R" —> C satisfies (A), then A 
is given by 


(1.39) A(X +++ Xn) = >) (Ani x) + iAno(en)), 
k=1 


where Axi, Ax2: R > R (k = 1 ton) are solutions of (A). Further, the continuous 
solutions are given by 


A(X1,%2,°°+ 5 Xn) = Cx, +02x2 +--+ + enXn, 


where cx’s are reals when A is real valued or are complex constants when A is 
complex valued. 


Proof. It is similar to the proof of Theorem 1.21 and follows from Corollary 1.22 
and Theorem 1.2. 


Theorem 1.25. [fa function A : R4 — R satisfies (A) and is (i) continuous at a 
point, (ii) monotonically increasing, (111) nonnegative for small x, (iv) locally inte- 
grable, (v) measurable, (vi) bounded in an interval, or (vii) bounded on a bounded 
set of positive measure, then A(x) = cx, for x € R, where c is an arbitrary constant. 


Proof. The proof is similar to proofs of Theorem 1.1 and Theorem 1.2. Later we will 
look at it from the point of view of extension of A. 


1.3.2 More Algebraic Conditions 


Theorem 1.26. (Kurepa [590], Vrbova [823]). Additive functions Aj, Az: C> C 
satisfying (A) and 


(1.40) (@) Are) =1PAi (2), ) Are) =-Kel*Aa(*), cee 40, 
are continuous and are given by 
(1.41) A,(z) = Aj (1)Re z, A2(z) = Ao(i)Im z. 
Proof. 
Define 
d\(z) = Ai(z) — Ai (1)Re z 
and 


d2(z) = A2(z) _ A2(i)Im z forzeC. 
Note that d; (1) = 0 and d2(i) = 0, and d, and dp satisfy (A). 
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From (A) and (1.40) for z 4 —1, 
d\(z) =d\(z +1) = A(z + 1) — Ai ()Re (z +1) 


1 
— I+ 1\7Aq (—) — A,(1)Re (z+ 1) 


ra 
=|z¢+1]°Ay (1 = —) — A, (1)Re (z + 1) 


Zz 


=—|z+1)?Ay (= :) + (Rez + |zI”)A1(1) 


ztl 
=-|z\*Ay (=) (Re z + [z|")A1(1) 


1 
== (ica: (-) — A;(1)Re :) = —d\(z). 


Hence d;(z) = 0 for all z, including 0 and —1, and A;(z) = A,(1)Re z forall z € C, 
which is the first part of (1.41). 
To obtain the second part of (1.41), now, by (A) and (1.40)(b) for z 4 0, 


do(z) = A2(z) — A2(i)Im z = —|z|7A2 (-) — Ao(i)Im z 


= —|z| (42 (-) — Ao(i)Im (-)) = —|z|"dp (-) ,2#0. 


From (A) and (1.40), since d2(i) = 0, we have 


1 
iO206-s 2-4 (—) ate, (: fe 
Lok Le 


. zi . z—i 1 
= |z—i|?dy (=) = —|zi|"d2 (=) = |z|"do (-) 
y ae | vas vb 


= —d2(z), implying that d2(z) = 0 


for all, z including z = 0; that is, A2(z) = A2(i) Im z. 

Equation (1.40)(b) was used in [823] in connection with the characterization of 
complex inner product spaces. Now we present another proof of the same. 

With z =iy, y € R, (1.40)(b) yields 


i 
An(iy) = y7A2 (-) » y(AOER. 
Forr € Q, yEeR, 


AsGotO = OS0r A (—) 
y+r 


that is, 


1 
Ao(iy) + Ao(ir) = (* + 2yr +r°)A2 ( ) 
yor 
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or 
yaa (=~ ) + Aa ir ) avr ) 
y oyrr yr ytr 
or 
24a (=) + Aa ( 2 ) =2va ) (cancelling r) 
yy +r) y+r ver 
or 


Ao(i(y tr)y) + y7A2 (eo) = 2yA2(i(y +r)) (canceling — 
bd (y+r) 


or 
i ; ; 
Aa(iy*) + rAaliy) + y"Aali) + ry" Aa (<) = 2y[Aaliy) +r Ao]. 
Equating the coefficient of r, we get 
A2(iy) = A2(i)y forally ER. 


Now we will prove that A2(x) = 0 forx € R. 


Ao(z) = Ao(x) + Aaliy) = —(x?2 + y2)Aa (=) (using (1.40)(b)); 


that is, 


x2 + y? 


= ) (using (1.40)(b)). 


Aa(x) = —(x? + y*)A2 (=) = Pal 


Now y = 0 gives Ax(x) = x*A2(x). Hence A2(x) = 0 forx # +1. But 
A2(+1) = 0. So, A2(x) = 0 for x € R. This proves the second part of (1.41) 
and the proposition. 

We now prove the following proposition, which is a generalization of Theo- 
rem 1.26. 


Theorem 1.27. (Kurepa [590], Kannappan [458]). The general solutions A, A2 : 
C > C satisfying (A) and 


(1.42) Ai(z) = —lel?A2 (<). Xe, 


are continuous and have the forms 
(1.43) Ai(z) =aRez+bImz, A2(z) = —aRez+Imz, forzeC, 


where a = Ai(1) = —A2(1) and b = A,(i) = A2(i) are complex constants. 
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Proof. Defining dj(z) = Aj(z) — Aj(1)Re z, (j = 1,2), z € C, and using (A), 
(1.42), and Ay(1) = —A2(1), we obtain 


d\(z) = Ai(z) — Ai (1)Rez = —|z|7A2 (<) — Aj(1)Re z 


(1.44) = —|z|*(A2 (-) — A2(1)Re (-) = —|z|7d2 (-) » 20. 
Zz Zz Zz 


Observing that d; and d2 are additive and d;(1) = 0 = d2(1), we get 


di(z) = d\(¢— 1) = —|z— 1]? (—) Slee i as (= = i) 


z—l 


Zz z—l 1 
(1.45) = -|z-1/*d (=) = (2/7 (=) = -|z\"dy (<) : 
z—-l1 Zz Z 


which is (1.40). Hence, by (1.41), di(z) = di(i)Im z; that is, Ay(z) = di(z) + 
A,(1)Re z = A, (i)Im z + A; (1)Re z. 

Further, (1.44) and (1.45) yield Dj = D2 and A2(z) = Di (i)Im z+A2(1)Re z = 
A2(i)Im z — Ai (1)Re z. Thus (1.43) holds. This completes the proof of this propo- 
sition. 


1.4 Alternative Equations, Restricted Domains, and Conditional 
Cauchy Equations 


Authors working in these areas are numerous, and the literature is rather volumi- 
nous (refer to Fischer [272, 278], Forti [284], Ger [304, 305], Dhombres [218, 221], 
Kannappan and Kuczma [485], Swiatak and Hosszut [786], and Vincze [819], which 
is only a partial list). When studying functional equations, one clearly notices that 
their solution depends not only on the functional equation and the class to which 
it belongs (like measurable or continuous functions) but also the domain of defini- 
tion of the function (for example, the equation f(xy) = f(x) + f(y) has solution 
f(x) = clog x on positive reals for measurable f, whereas on nonnegative reals or 
on R the only solution is f(x) = 0). Traditionally, authors used to suppose that the 
equation holds for all values of the variables on a certain set that appears normal for 
the equation. The classical example cited is the equation (A) holding on all of R. 
But it is found that in many situations the equation is either postulated on a certain 
subset (for example, (A) on nonnegative reals in economics) or is satisfied on some 
subset of the domain of definition of the function, leading to the study of functional 
equations on restricted domains. We will later study in some detail the extensions of 
functions. Some of the equations come from non-Euclidean relativity theory. Here 
we will deal with some of the problems above. 
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1.4.1 Alternative Equations 
A function A : R — R that is additive also obviously satisfies 
(1.46) [A(x + y)P = [A@) + AQ)P 


for x, y € R. The converse implication is less obvious. Several authors have studied 
the problem of the equivalence of (A) and (1.46). The equation (1.46) yields either 
A(x + y) = A(x) + A(y) or A& + y) = —A(x) — AQ), earning it the name 
alternative functional equation. We will prove in the next theorem the equivalence 
of (A) and (1.46) (see [819, 272]). Equations (1.46) and (1.50) are special cases of 
the equation 


[f@ +y)]" =[f@) + fo!" 
which is equivalent to (A) ([819]). 


Theorem 1.28. A nonzero function A : R — R that satisfies (1.46) also satisfies 
(A); that is, (A) and (1.46) are equivalent. 


Remark. The equation (1.46) is equivalent to the conditional Cauchy (additive) 
equation (A) for all x, y € R such that the following holds: A(x + y) + A(x) + 


A(y) #9. 

Proof. (Vincze [819]). For x, y, z in R, utilizing (1.46), we have 

A(x + y)+AQ@)P = [A@ + y)P+IA@P+2AG + y)AC) 
= [A@P + [AO)P + [A@P + 2A@)AG) + 24G + y)AG) 
=[A(Qx)+ AQt+2)/ 

= [A@)? + [AO)P + [A@P + 2AQ)AG) + 2A + ZAG); 


that is, A(x)[A(y + z) — A(y)] = A(zZ)[AG@ + y) — A(y)]. Since A ¥ 0, there is a 
zo € Rsuch that A(zo) 4 0. With z = zo, the equation above can be rewritten as 


[Awtyt+z/= 


[reen l rntiill een len 


(1.47) A(x +y)=A(y)+AQ@)k(y) forx,y €R, 
where k(y) = Ar) AQ) 
Interchanging x aad y in (1.47), we get 


(1.48) Atx+y)=AQ)+AQ)k@) forx,y ER. 


We will now show that k(y) = 1 for all y in R, so that (1.47) becomes (A). Indeed, 
by considering A(x + y+z) as A[x+(y+z)] and A[(x +y) +z] (using associativity) 
and using (1.47), we obtain A(z)k(y +x) = A(z)k(y)k(x). Since A 4 0, k satisfies 
the functional equation 


(1.49) k(y+x)=kQ)k@) forx,y eR. 


If k = 0, then (1.47) gives that A = constant, and then by (1.46) we conclude that 
A = 0, which is not the case. Then k(x) > 0 for all x € R. Letting y = x in (1.46) 
and (1.47), we have 
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[AQx)P =4[A(x)P and A(x) = A(x)(1+k(x)). 


Squaring the latter and noting that k(x) > 0, we get k(x) = 1 when A(x) 4 O. 
Suppose A(y) = 0. Use (1.47) and (1.48) to get A(x + y) = A(x)k(y) = A(x) (that 
is, k(y) = 1) since A 4 0. Hence k = | and A satisfies (A). 

The same conclusion can also be obtained by the following argument. First 
x = 0, y = 0 in (1.46) yields A(O) = 0, and then y = —x again in (1.46) gives 
A(—x) = —A(x); that is, A is odd. Equating the right sides of (1.47) and (1.48) 
results in A(y)[kK(x) — 1] = A@)[KGQ) — 1]. 

If k(y) = 1 for all y in R, there is nothing to prove. If not, k(x) — 1 = kj A(x) 
for ky anonzero constant. Then (1.46) becomes 


A(x + y) = AQ) + AQ) +k A)A(Y). 


Setting y = —x, the equation above yields k[A(x)]? = 0 (that is, A(x) = 0) since 
k, 4 0, which is again a contradiction. This proves the theorem. 


Theorem 1.29. Let f : R > R satisfy the functional equation 


(1.50) If@+yl=If@)+fO)| forx,yeER. 


Then f satisfies (A); that is, (A) and (1.50) are equivalent. 


Proof. First set x = 0, y = 0 in (1.50) to get f(0) = O and then y = —x in (1.50) 
to obtain f(—x) = — f(x) forx € R; thatis, f is odd. 
Further, for x, y € R, by using (1.50), we have 


(1.51) If@+y)—fO=lf@+y)+ F-x)| = IFO). 


For a,b € R, it is true that |a + b|? + la — b|? = 2a? + 2b. First take a = 


fat+y), b= f@)— fy) and thena = fa +y)— f(x), b = f(y), and use 
(1.50) and (1.51) to have 


Ife +ty+f@)-fOP+I1FG+y) -—f@)+ FOP 


(1.52) = 2 f(« + yl? +21f&) — FOI 
=2|f@)+ FO)? +21f@ — FO 
= 4 f(a)? +41f OP 

and 


If@ty)—f@)+ FOP +1F@+y) — Fe) -— FO)? 
=21f@+y)— Ff)? + 2F OP 
=21fO)? + 21f OY? 
=4/f()/. 
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Interchange x and y in the equation above, add the resultant to it, and use (1.52) to 
obtain | f(x + y)— f(x)— f(y)? = 0, so that f is additive. This proves the theorem. 


Remark 1.29a. Observe that no regularity assumption is made either in Theo- 
rem 1.28 or Theorem 1.29. As a matter of fact, the results above hold in a more 
general setting. 


Result 1.30. (Kuczma [558], Fischer [272], Fischer and Muszély [278]). Let (X, +) 
be a semigroup and let (Y,+,+) be a commutative ring. The functional equations 
(A) and (1.46) are equivalent in the class Y* if and only if Y contains no genuine 
nilpotent elements nor elements of order 3 in the group (Y, +). 


Result 1.31. (Fischer and Muszély [278], Dhombres [218]). In the class of functions 
f : S — H, where S is a commutative semigroup and H is a Hilbert space (either 
real or complex), the equation (A) is equivalent to 


f+ WI=NFC)+ FOI forallx,y eS. 


Here is a result along the same lines from complex variables due to H. Haruki 
[350]. Suppose f : C > Cis an entire function satisfying 


If +iy)| =|f@)+af(iy)|, x,y eR, ainC. 
Then the solutions are given by 
f(z) =aexp(bz) whena=0, z€C, 


or 
f(z) = az, asinbz, asinh(bz) 


when a = | for z € C (see also [706, 349, 348]), or 
f(z) = az+f27, asinz+fcosbz—f, asinh bz+fBcosh bz—f whena=-—1, 


or 
f(z) =0 when|1+a|41, f(z) = constant, 


when |! + a| = 1 in all other cases of a 4 0,1, —1 where b is real and a, f are 
complex constants. 


1.4.2 Restricted Domains and Conditional Cauchy Equations 


Frequently, in many situations, one encounters functional equations with a restricted 
domain when compared with what could be thought of as the natural domain. In the 
literature, “functional equations on restricted domains” (Kuczma [558]) and “condi- 
tional functional equations” (Dhombres and Ger [221]) have been used to describe 
this situation. We will return to this later in Sections 1.6.4, 1.7, etc. 

In the study of the measure of nonlocalized oriented angles, Dubikajtis and 
Kuczma [232] came across the functional equation f(x; + x2) = f(x1) + f(2) 
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for |x1| < oe |x2| < >, the Cauchy equation (A) on a restricted domain. 
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1.4.3 Mikusinski’s Functional Equation 


J. Mikusitiski asked for one-to-one, onto mappings (bijections) T : R* —> R? that 
map straight lines onto straight lines. The problem could be solved without the use 
of functional equations, and this is what was done by Mikusinski and S. Karnik. 
However, here we present the earlier version of the argument leading to what is now 
called Mikusinski’s functional equation (also refer to [558, 218, 44]). 

As T is one-to-one, it maps parallel lines into parallel lines and so transforms 
parallelograms into parallelograms. First of all, write T : R* — R* as T(x, y) = 
(F(x, y), G(x, y)) for x, y € R*. We will determine F and G, where F, G : R? > 
IR. We may assume without loss of generality that F (0,0) = 0 = G(0, 0). 

The parallelogram property applied to the points (0,0), (x, 0), (x, y), and (0, y) 
yields the system of functional equations 


(1.53) F(x, y) = F(x,0)+ FO, y), G@, y) = G@,0)+ GO, y). 


The collinearity of the points (0,0), (1,0), and (x, 0), x € R, x 40, results in 
the collinearity of T(0,0), T(1, 0), and T (x, 0), whence 


(1.54) aF(x,0)=bG(x,0) forx ER, 


where the constants a = G(1,0) and b = F(1,0) are not zero simultaneously; we 
may assume b # 0. 
In the same way, the collinearity of (0, 0), (0, 1), and (0, y) leads to 


(1.55) cF(0,y)=dG(0,y), fory €R, withd £0. 
From (1.53), (1.54), and (1.55), we obtain 
(1.56) d[bG(x, y) —aF(x, y)] = (bc —ad)F(O,y) forx,y ER. 


In view of (1.56), bc — ad 4 0, for otherwise we would have bG(x, y) = 
aF (x,y), which implies that the image 7 (R7) is contained in the line y = Bt 
contradicting the assumption that T is onto. 

As x varies in R, the line (x, x) maps to the line (F(x, x), G(x, x)); that is, 


(1.57) aF(x,x)+fPG(ax,x)=0, with ja|+ |f| 40. 
Combining (1.53), (1.54), (1.55), and (1.57), we get 
(ab + Ba) F(x, 0) + (ad + Bc)F(0,x) =0, 


where |ab + fa| + jad + Bc| 4 0 (otherwise bc — ad = 0). If ab + fa = 0, 
then F (0, x) = 0, which by (1.56) contradicts the bijectivity of T. Similarly, ad + 
fc = 0 would imply F(x, 0) = 0, which due to (1.54) gives G(x, 0) = 0, again 
contradicting that T is one-to-one. Hence, we have 


(1.58) F(0,x) =kF(x,0), forx €R, withk £0. 


Define f(x) = kF(x,0), x € R. The problem boils down to finding f. 
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Finally, from the collinearity of (x + y,0), (x, y), and (0, x + y), and by (1.53), 
(1.54), (1.55), and (1.58), we obtain 


F(x + y,0) Ga +y,0) 1 
F(x, ¥) G(x,y) 1) =0; 
FO,x+y) GO,x+y) 1 


that is, 


f(x+y) sity) 1 
FR)+KO) £f@) + ¥ f(y) 1] =0, 
kf(ety) “f@+y) 1 


the equation 


(1.59) f@+yMIfe+y)—- fO)—-fO)=0 forx,yeR. 


This equation is a conditional Cauchy equation known as Mikusiriski’s equation. It is 
obvious that (A) implies (1.59). But it is less obvious that in fact (1.59) implies (A) 
without any regularity condition. It is still more interesting that this conclusion fails 
to hold if we restrict the domain of f to the set of integers Z. The equation (1.59) is 
equivalent to the condition 


(1.59a) if f(x + y) £0, then f(x + y) = f(x) + fO). 
We now proceed to determine the solution of (1.59) in a general setup. 


Result 1.32. (Dhombres [218], Kuczma [558], Ger [304, 305], Dhombres and Ger 
[221]). Let X and Y be groups (not necessarily Abelian). If the group X has no 
(normal) subgroup of index 2, then the conditional equation (1.59) for functions 
f :X— Y is equivalent to (A). If X has a normal subgroup K of index 2, then the 
family of all solutions f :X — Y of (1.59) consists of all solutions of (A) and all 
solutions 


0, xeEkK 


(1.59b) roy=|" eek 


where c is an arbitrary constant. 


Remark. It is evident that if X = R = Y, then (1.59) or (1.59a) and (A) are 
equivalent, and that if X = Z = Y, the class of solutions of (1.59a) consists of the 
linear functions f(x) = cx (c an integer) and all functions of the form f (x) = 0 for 
x €evenand f(x) =c for x € odd, where c ¥ 0 is an integer. 


We now return to the Mikusinski problem. 


Theorem 1.33. Let T : R* — R? be a bijection mapping straight lines into straight 
lines and continuous at a point. Then T is linear; that is, T(x, y) = (ax + fy, yx+ 
dy), and a, fh, y, 0 are constants. 
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Proof. From the arguments above, Result 1.32, and Theorem 1.2, we can conclude 
that f(x) = F(x, 0) is additive and f(x) = ax for some constant a. Now, from 
(1.53), (1.54), (1.55), and (1.58), we have F(x, y) = ax +kay, G(x, y) = Skax + 
¢ay, and the conclusion holds. 


The conditional functional equation 


(1.59c) f(x) + f(y) 40 implies f(x + y) = f(*) + fy) 
symmetric to (1.59a) was treated by Dhombres and Ger [218, 221]. 


Result 1.34. Let X, Y be groups (not necessarily Abelian), and suppose that Y does 
not contain elements of order 2. Then a function f : X — Y satisfies (1.59b) if, and 
only if, it satisfies (A). 


The functional equation 


(1.60) [f@ +y)—af (x) — bf IF @ + y) — F@) — fO)] =9, 


where a, b are constants, is a joint generalization of (1.46) and (1.59). The following 
result was proved by Kannappan and Kuczma [485]. 


Result 1.35. Let G(+) be a commutative group and H (+, -) be an integral domain 
of characteristic zero. Let f : G — H be a solution of (1.60). Then K = {x € 
G| f (x) = 0} @fnot empty ) is a subgroup of G and only the following four possibili- 
ties can occur: 


Gj) a+b=1, K = ¢, and f is constant. 
(Gi) a+b=0, K is of index 2, and f has the form given by (1.59b). 
(iii) a+b = -1, K is ofindex3, and f has the form f (x) =Oforx € K, f(x)=c 
for x €xo+K, and f(x) = —c forx € —x9 + K, where xo ¢ K. 
(iv) a, bare arbitrary constants, f is additive (that is, f satisfies (A)), and the index 
of K is either | (in which case f = 0) or infinite. 


1.5 The Other Cauchy Equations 


Let us treat the functional equations 


(E) E(x+y)= E(Qx*)EQ) (exponential), 
(L) Lay) = L@)+LQ) (logarithmic), 
(M) M(xy) = M(x)M(y) (multiplicative), 


also known as Cauchy’s equations. In order to distinguish one from the other, we 
will identify them as additive, exponential, logarithmic, or multiplicative equations. 
All of them can also be thought of as homomorphisms between semigroups with 
appropriate algebraic structures. 
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For example, let S and H be semigroups. Then (A) is a homomorphism between 
S(+) and H (+), (E) is between S(+) and H(-), (L) is between S(-) and H(+), and 
(M) is between S(-) and H(-). 

One can find solutions of these equations either by a method similar to that em- 
ployed for solving (A) or by other means. But one can also find the solutions more 
promptly by putting them in a form analogous to that of (A). First let us consider 


(BE). 


1.5.1 Exponential Equation 


Suppose E satisfies (E) for x, y € T, where either T = R, T = R4, or T = Ry}. 
In the first two cases (that is, for all real or all positive x, y), we will prove that a 
solution of (E) is either identically zero or nowhere zero. 

Indeed, suppose E (xo) = 0 for some xo. Then (E) yields 


(1.61) E(x) = E(x — x0): E(x). 


If T = R, (1.61) shows that E(x) = 0 for all real x € R; that is, everywhere. 
If case (E) holds only for positive x, y, (1.61) shows that E(x) = 0 for x > xo. Let 
x € ]0, xo[. Then there is an integer 7 such that nx > xo. Now, from (E) and (1.61), 
we have 0 = E(nx) = E(x)", and thus again E(x) = 0 forall x € R%. 

The argument above implies that E(x) = 0 for all x > O but not for x = 0. Asa 
matter of fact, E given by 


(1.62) E(x)=0, forx >0, E@)=1, 


is a solution of (E) on the domain T = R,. 

As E(x) = 0 is a solution of (E) in all these cases, without loss of generality we 
can assume that E(x) 4 0 for all x € T; that is, E is nowhere zero. 

Replacing x and y in (E) by 5, we obtain 


EG@)=E (Gy > 0, 


from which it follows that any nontrivial solution of (E) is always positive. Now 
taking the logarithm on both sides of (E), (E) goes over to (A), where 


(1.63) A(x) =log E(x) and E(x) = e4@), 
Thus we have proved the following theorem. 


Theorem 1.36. Let E : T — R satisfy (E). Then the most general solutions of 
(E) are given by E(x) = 0, and by (1.63), E(x) = eA), where A is an arbitrary 
solution of (A) on the same domains. If T = R, we have the additional solution 
(1.62), but no others. 
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Corollary 1.36a. The most general continuous (continuous at a point, measurable, 
measurable on a set of positive measure, strictly monotonic on x, y > a > O, etc.) 
solutions of (E) are given by 


(1.64) E(x)=0 and E(x)=e", 


where c is a constant on the domain T = R or R%.. If (E) holds on T = Ry, 
continuous solutions are still given by (1.64), but in the case of measurability by the 
solution (1.62) also. 


The proof follows from Theorem 1.36 and Theorem 1.2. 


Theorem 1.37. Suppose v : R4 — C is a nontrivial complex-valued function and 
satisfies (E) with v(0) = 1 and |v(x)| is bounded in some interval [a,b]. Then 
|v(x)| = exp(cx) for some real constant c. 


Proof. (Kannappan [426]). Define A(x) = log |v(x)| for x € Ry. Then A is well 
defined and satisfies (A) with A(0) = 0 and A is bounded from above on [a, b]. 
Then, by Theorem 1.2, A(x) = cx for x > 0. This proves the theorem. 


Definition: Character. Set y(x) = al for x € Ri, where v satisfies (E) with 


v(0) = 1. Then it is clear that x also satisfies (E) for x, y > 0 and |y(x)| = 1. For 
negative x, we may set y(x) = [y (=) Then y satisfies (E) for all real x. Such 
a function is called a character ([369, 376]) of the real line. 


Theorem 1.38. (Hille and Phillips [376]). A mapping x : R — C is a measurable 
character of the real line if and only if x(x) = e!”* for some real constant b. 


Proof. For |6| > 0 anda > 0, we have 


[x(x +0) — x(x)Ja = : Ixy(x+d—y)— x(x —- y)]xQ)dy. 


Hence i Hi 
+9) xe) s =f [gee 3-9) x0 = y)ldy, 
0 
which tends to zero with |6|. Thus x is continuous. Further, 


x(6)-1 


a 1 a 
‘ [ xeoar = | pe 7Old 
0 0 


1 ato 1 a 
= al U(x)dx — =f Y(x)dx. 


Choose a such that Ie y(x)dx # 0. Since the limit as |6| — 0 exists for terms on 


the right-hand side of this equation, it follows that the derivative of v at x = 0 exists, 


and let dx) |,-9 = #. Since y satisfies (E), we can conclude that the derivative of 


x exists for all x € R and that ax) = fy (x). In fact, since 


x(6)-1 


xe +0) — x(x) _ 
a +9) — x) _ me) 


5 x(x) 
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taking the limit as |6| — 0, the result follows. Hence x(x) = ce’*. Since 
x(0) = 1, c = 1. As |y(x)| = 1, we see that f# is purely imaginary; that is, 
f£ = ib for some real b. 


Corollary 1.39. [fv : R~ — C is measurable and satisfies (E) nontrivially with 
v(0) = 1, then 
v(x) = e@t')* for some reals a, B. 


Proof. Define A(x) = log |v(x)|. In the first place, A is measurable and additive on 


R;. Hence A is continuous, and A(x) = ax for some real a. Thus |v(x)| = e**. 
Further, 7(x) = Tra is measurable. So, by Theorem 1.38, y (x) = e'4* for some 


real /. The result is now immediate. 


Result 1.40. (Aczél and Dhombres [44]). The general solutions E : R — C of (E) 
are given by 


E(x)=0 and E(x) = eA@tiA2@), 


where A: R — R is additive and A2 is an arbitrary solution of 
(1.65) Az(x + y) = Ao(x) + Ao(y) (mod 2z) forallx,y ER. 


Finally, we have the following result. 


Result 1.41. [44]. The general solutions E : C — C of (E) are given by E(z) = 0 
and E(z) = eA1@)+A2(y)+iA3()+iA4()), where A, and A2 satisfy (A) on R and A3 
and Ag satisfy (1.65) on R. Further, if E is measurable, then E(z) = O or E(z) = 
e%+B2, where a and B are complex numbers. 


1.5.2 Logarithmic Equation 


Theorem 1.42. /f L : R* or Ri. > Ris a solution of (L), then the most general 
form of L is given by 


(1.66) L(x) = A(log |x|), for x € R* or L(x) = A(logx), x € Ri, 


where A satisfies (A) on R* or R*.. 


Proof. First notice that L is even when (L) holds on R* because, with y = x, (L) 
gives L(x*) = 2L(x), which when x is replaced by —x yields L(x*) = 2L(—x), 
showing thereby that L is even; that is, L(—x) = L(x). 

Now let x and y be positive. We will show that (L) can be reduced to (A). There 
exists u, v € R such that x = e“ and y = e”. By defining 


A(u) = L(e") foru eR, 


the equation (L) becomes A(u + v) = A(u) + A(v) foru,v € R; that is, L(x) = 
A(log x) for x € R}. 
The result on R* follows from the fact that L is even. 


Remark 1.42a. The only solution of (L) on R or Ry is L(x) = 0. 


30 1 Basic Equations: Cauchy and Pexider Equations 


Corollary 1.43. The general solution of (L) that is continuous, continuous at a 
point, measurable, or strictly monotonic on R*, is given by L(x) = clogx, and on 
R*, the solution of (L) that is continuous, continuous at a point, measurable, or 
bounded from one side on a set of positive measure is given by L(x) = clog |x| for 


x £0. 


The proof follows from Theorem 1.42 and Theorem 1.2. 

The domain of the logarithmic equation (L) can be restricted and can be seen 
from the following results. 

Suppose (L) holds for x, y € ]0, 1]. Setting x =e", y = e7” (u, v = O; that is, 
u,v € R,), (L) goes over to (A) on R, by the substitution A(u) = L(e~“). Thus 
we have the following result. 


Result 1.44. /f L :]0,1] — R satisfies (L), then L is given by L(x) = A(—logx) 
for x € ]0, 1], where A is additive on R. If L of (L) is continuous or measurable, 
then L(x) = clogx for a real c, and if L of (L) is nonnegative on ]0, 1], then 
L(x) = —c log x for a nonnegative constant c. 


Result 1.45. [44]. Let L : Z7. > R satisfy (L), that is, 
(1.67) L(mn) = L(m) + L(n) 
holds for all positive integers m and n with 


lim in 


n LG +1)—L(n)] => 0. 
Then there is a constant c such that L(n) = clogn, n € Zi. 
Result 1.45a. [43]. Suppose L : Z7, — R satisfies (1.67) with 
nL(n) <(n+1)L(14+1) forn=1,2,.... 


Then L(n) = c logn for some constant c > 0. 
The same conclusion holds when L satisfies (1.67) and lim [L(n + 1) — 
n—->Co 


L(n)] = 0. 
Result 1.46. [218, 43]. The general monotonic solution of 
L(xn) =L(x)+L), forx e[l,ool, ne Zi, 


is given by L(x) =clogx, for x € [1, cof, where c is an arbitrary real number. 


1.5.3 Characterization of Exponential and Logarithmic Functions 


The functions e* and log x can be characterized by means of the equations (E) and 


(L), respectively (see the theorems above Theorem 1.42 and Result 1.44). But these 
functions can also be characterized with the help of the equations 


(1.68) EQx)= EG), 
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(1.68a) LG") =20), 


in a single variable and some additional conditions. The following result will be of 
interest in that direction (see [550, 558]). 


Result 1.47. (Kuczma [550]). The function E(x) = e* is the only function that 
is either (i) differentiable in R and satisfies (1.68) with E(O0) = E'(0) = 1 or 
(ii) logarithmically convex in IR*. and satisfies (1.68) and the condition E(1) = e. 

The function L(x) = log x is the only function that is either (i) differentiable in 
[1, co[ and satisfies (1.68a) with L'(1) = 1 or (ii) concave in |1, co[ and satisfies 
(1.68a) and the condition L(e) = 1. 


There are other characterizations of the logarithmic function (see also 
Chapter 17). 


Theorem 1.48. (Heuvers [367]). Let f : Ri. > R satisfy the functional equation 


1 1 
(1.69) fis +y)~ Fo Fo) =F (5 +>) forx,y ER}. 
Then f satisfies (L) and conversely. 


Here we present a modified proof from [367]. 
Proof. If f satisfies (L), it is easy to check that f also satisfies (1.69). Now let f be 
a solution of (1.69). Define 


Fa,y)=f@+y)—f@)—fO) forx,y eR}. 
F is called a Cauchy difference and satisfies 
F(x+y,z)+ F(x, y) = Fy +z,x) + FQ, 2) 
(see Chapter 13). Now (1.69) yields 


1 1 1 1 1 1 1 1 
(1.70) f{—++-]4+/f{-+-]=f +—-)4+f(-+-), 
(— :) (; s) (= 7) (5 :) 


for x, y,z € R4. Note that f(1) = 0 and f (+) = —f (x). 
Set + a 4 = 1 in (1.69). Clearly y andz > landz= i Now 


1 1 1 y-1_ y*+xy—x 
x+y Zz x+y y yx + y) 
and 
1 1 yt+xy—x 
as =e 


Now (1.70) becomes 


ae ae (- ~) = yr t+xy—x 
r( y@+y) Jer aa ad xy? 
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for x > O and y > 1, which by the substitutions 


1 yy? +ay—x 
ol + = 0 SS 
x oy ya@+ty) 
can be written as 
(1.71) fluo) = flu) + f). 


Since x > Oand y > 1, uw and v are positive and there may be further restrictions on 


u and v. We have 
x 


~ yet+y) 


Then evidently 0 < v < 1 since x > 0, y > 1, and 


u=—-+-,0=1 
x y 


v= mr 
that is, 
1 
uv =u-—= 
y2 
or 
1 
—=  Ju(1—vd) 
y 
and 


1 _wW—(u(l—o)) | u(u+v—1) 
gOS ea a 0) wt Salo) 


Since 0 < v < 1landx > 0, u+v-—=1 > 0, which will be satisfied if u > 1. 
Thus (1.71) holds for u > 1, 0 < v < 1. Also, since 
fu-)= fu) = fu)t+ fQ, 
fo - D)= fe) = fe)+ fd), 
(1.71) holds for u > 1 andO <v <1. 


Now we will show that (1.71) holds for all u,v > 0. For u,v > 0, choose n 
sufficiently large that nu > 1 and = < 1. Now 


fu) =f (mu ~) = f(nu) + f (-). by (1.71), 


and 


fo) =F (2-0) =F (=) +f, again by 1.70), 
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so that 


1 
flu) + flo) = fan) + f (~) +f(-)+F@ 


= f (nu) +f (-) (sine f (-) 2 -40)) 


= f (mu: ~), by (1.71), 
= f(uv). 


Hence f satisfies (L) on R%,. This proves the theorem. 


Theorem 1.48a. Suppose f : Ri. — R satisfies either 


1 
(1.69a) f(xy) -— f@) =—-f (+). forx,y > 0, 
or 
1 1 
(1.69b) fay) =—-f (-) —f (-) , forx,y>0. 


Then f satisfies (L) on R*.. 
Proof. It is easy to see that f(1) = 0 in either case. Suppose f satisfies (1.69a). 
Then interchanging of x and y shows that f(x) + f (+) = c constant. Set x = | to 


getc = Oand f (+) = — f (x). Then (1.69a) is (L). As for (1.69b), let y = + to get 


f@+f (+) = 0. Thus f satisfies (L). This completes the proof. 


1.5.4 Multiplicative Equation 


The equation (M) is the most complicated of all the three equations. We obtain the 
solution of (M) in the following theorem. Let T be either R;, R4_, R, or R*. 


Theorem 1.49. The general solutions M : T — R of (M) are given by 
(1.72) M(x)=0 or M(x) =e48*),  forx ERY (fT =R4), 


eAlogx)) xe Ri 
0, x=0 (fT =R, 


—— 


(1.72a) M(x) =0, M(x) =1, M(x) = | 


(1.72b) 
M(x) =0, M(x) =1, M(x) = sign x, M(x) = |sign x| for T = R or R*, 


eAllog|xl) x € R* 
0, x=0 


sign xeA@oe lt), x € R* 
0, x=0 


> 


mey=| orm) ={ 


where A is a solution of (A) on R. 
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Further, M : T — R is a continuous (continuous at a point) solution of (M) if, 
and only if, either M = 0 or M = 1 or M has one of the forms 


M(x) = |x|°, M(x) = (sign x)|x|°, x ET, 


for some constant c. If 0 € T, thenc > 0. 


Proof. (Aczél [12], Kuczma [558], Kannappan [426], Aczél and Dhombres [44]). 
Let x,y € R‘. There exist u,o € R such that x = e“ and y = e?. Setting 
E(u) = M(e"), u € R, the equation (M) goes over to (E). Then, by Theorem 1.36, 
(1.72) follows. Thus the continuous solutions of (M) in this case are M(x) = 0 or 
M(x) =x°. 

Now put x = 0 in (M) to get M(0)(1 — M(y)) = 0; that is, either M(0) = 0 or 
M(y) = 1 for all y € Ror Rx. This proves (1.72a). 

Letting y = 1 be put into (M), we obtain either M(1) = 1 or M(x) = 0 for all 
x € T. Now we will consider T = R or R* with M(1) = 1. Setting x = -—1 = y 
in (M), we get 1 = M(1) = M(—1)’; that is, either M(—1) = 1 or M(—1) = --1. 
Finally, set y = sign x (x 4 0) in (M) to get 


M (|x|) = M(x)M(signx), forx 40, 
which is the same as 

M(x) = M(|x]), for x € R*, 
or 

M(x) = (sign x)M (|x|), for x € R*. 


Now, (1.72b) follows from (1.72) and (1.72a). Further, continuous solutions can 
be obtained from (1.72), (1.72a), (1.72b), and Theorem 1.2. 


1.6 Some Generalizations of the Cauchy Equations 


First we start with the Jensen and the Pexider equations. We study them in a general 
setting, too. One of the interesting and surprising aspects of functional equations is 
that one equation can determine many unknown functions. We will notice this in this 
section when dealing with the Pexider equations and several times in later sections, 
too. 


1.6.1 Jensen’s Equation 


The equation 


a : e =) - 12)+/0) 
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is known as Jensen’s equation (Jensen [394], Aczél [12], Aczél, Chang, and Ng 
[40]). Whereas (J) makes sense in algebraic systems that are 2-divisible (that is, 
division by 2 is permissible), by replacing x by x + y and y by x — y, (J) goes over 
to 


(i) fa +y)+ f@ —y) = 2f(), 


which in a way eliminates this problem and makes sense in algebraic systems that 
need not be 2-divisible. The equations (J) and (J;) are equivalent in 2-divisible sys- 
tems. Both equations can be solved by relating them to the additive equation (A). 
The equation (J) is meaningful in an interval (a, b) without any restriction, whereas 
(J) holds on an interval with restrictions. The equation (J) is “geometrical” in nature, 
which preserves midpoints. We will use either of them, depending on the situation. 

Let G(+) be a group not necessarily Abelian and H (+) be a 2-divisible Abelian 
group. Suppose f : G — H satisfies (J;) with 


(K) fatytz=f@+z+y) forx,y,z EG. 


Remark. We will encounter this (K) condition several times later on, too. 


As a consequence of (K), we obtain 


(1.73) fotz=ft+y) forz,y eG. 


Note that in general (1.73) does not imply (K). 
Let x = 0 in (J;) to get f(y) + f(—y) = 2f (0) for y € G. Interchanging x and 
y in (J,), adding the resultant to (J;), and using (1.73), we have 


2f% +y)+2f0)=2f@)+2f0), x,y EG. 


Defining 
A(x) = f(x) — fF), 
we see from the equation above that A satisfies (A) since H is 2-divisible. Thus we 
have proved the following theorem. 


Theorem 1.50. (Aczél, Chung, and Ng [40]). The general solution f : G > H of 
(J1) satisfying (K) is given by 

(1.74) f@)=A@)+c, x EG, 

where A satisfies (A) and c is an arbitrary constant. 


Corollary 1.51. Let f : T > C, where T = R or Cis a solution of either (J) 
or (J). Then the general solution is given by (1.74) forx € T, where A:T > C 
satisfies (A) and c is a complex constant and the solution continuous or continuous 
at a point is given by 


f(x) =ax+c(whenT=R) and f(z)=az+bz+cwhenT =C, 


where a, b, c are complex constants. 
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The equation (J) is more meaningful on an “interval”. The following results will 
be of interest in this direction. 


Theorem 1.52. Let f be a real-valued function defined on an arbitrary interval and 
satisfy (J). If f is continuous, then f is affine; that is, 


(1.75) f(x) =cx+b 
for all x in the interval, where c and b are real constants. 


Proof. Notice that f maps midpoints to midpoints. 

Without loss of generality, let us assume the interval is [0, 1]. Let f(0) = b and 
fC) =a. For every x, y in [0, 1], it is evident that a € [0, 1]. The proof is based 
on induction. 

First, let us show that (1.75) holds for x, a dyadic number in [0, 1]. Then, since 
the dyadic numbers are dense in [0, 1] and f is continuous, we will have the required 
result (1.75) for all x € [0, 1]. From (J), we have 


Tyg fOr ll) fOr) bre.) 
2G as (oo eer ce 


where c = a — b. Suppose (1.75) holds for x. Then 


l O+1/2"\ b+b+ mc 1 
(=) = (FF ye —_7 =b+ anti 


t 


Now we will prove by induction that (1.75) holds for x = 3, 


such that 0 < t < 2”. 

Suppose (1.75) holds for x fork < mand x € [0, 1]. Now, to prove (1.75) for 
ae ifm = 2! the result follows from the induction hypothesis. Otherwise, we can 
write m = 2* +r such that 2+! > m. Notice that n — k > 1. Then 


(2) -)(=3=)- =e 


where ¢ is an integer 


Qn 2 g 


1 r 1 m 
PEN saat oat) 0 an 


which completes the proof. 


Remark 1.52a. Associativity is a useful concept. We will use it to solve Jensen and 
Pexider equations and at many other places in later chapters, too. 


Now we will consider f : G — R satisfying (J) with (K), where G is a 2- 
divisible group, and show that f has the form (1.74). 
Replacing y by y + z in (J), we have 
f(A) _ f@)+fO4+2) 


5 5 forx,y,z EG, 


_ f@+V+F@ 


5 (use associativity); 
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that is, 


fa@t+y)—f@)=f+z2)—-f@) forx,y,z€EG. 
So, for fixed y € G, we define h : G > R by 
h(x) = f(x +y)— fF). 


Then A is aconstant, say a function of y, g(y). Then 


fa@+y)=f)+e), forx,yeR, 
=f)+ 8) (by (kK), 

interchanging x and y, or 

f(x) —g(%) =d, aconstant, 
and 

f@+ty=f@)+ fO)-d. 
Define A : R > R by 

A(x) = f(x) —4, 

so that A is additive (A) and 


f(x) =A(x)+d_ forx ER, 
which is (1.74). 


A Generalization 
Use associativity again. 
Theorem 1.53. Suppose f, g,h:G— R satisfy 


f (- — _ 8) +hO) 


(1.76) ; 5 


forx,y €G, 
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with f satisfying (K), where G is a 2-divisible group. Then there exists an additive 


Aon G such that 
(1.76a) f(x) = AQ) + sash g(x) =A(x) +a, h(x) = A(x) +b, 


where a, b are real constants. 


Using only associativity twice, we can obtain from (1.76) 


(2) _ gt+y) +h@) 
2 ~ 2 
= g(x) + h(y +2) 


, forx,y,zEG; 
5) y 
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that is, 
g(x + y)— g(x) =h(y +z) — A(z) 
= a function of y 
= A(y) (say), forx,y,z EG, 
(1.76b) gx +y)=8()+AQ), 
(1.76c) h(y+z)=h@)+A(y), forx,y,z€G. 


Put x = 0 (identity of G) in (1.76b) to get 
g(y) =a+ Aly). 


Hence (1.76b) yields that A is additive. 
Set z = 0 in (1.76c) to have 


h(y) = A(y) +5, 


and these values of h, g in (1.76) yield f in (1.76a). (f and g can also be obtained 
by using associativity.) This proves the theorem. 


Remark. Notice that the argument above is valid when the domain is a non- 
commutative semigroup and the range is a commutative group. 


Remark. Note that one equation (1.76) determines three unknown functions f, g, h. 


Corollary 1.53a. Suppose f, g,h : R > R satisfy (1.76). The continuous solutions 
of (1.76) are given by 


b 
(1.76d) fee)sext¢ , g(x) =cx +a, h(x) =cx +b, 
forx ER. 
Result. (Kuczma [558]). Let C C R” be a convex set such that the interior of C 4 %, 
and let f : C > R bea solution of (J). Then f is given by 
fx) =AQa)+b forxec, 

where A: R” — R satisfies (A) and b, a constant. 

The proof of this result involves extensions of functions, which is the topic of 
discussion in the next section. 
Isometry 


Suppose EF and F are two normed linear spaces over R. A mapping f : E > F 
satisfying the equation 


FG) — FO = Ile — yI 


is called an isometry. 
Evidently f is one-to-one and continuous. We know from Theorem 1.50 that if 
f : E> F satisfies (J,) or (J), then f has the form (1.74). 
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Result 1.54. [44]. The general solution f : E — F of (1.76), which is onto and 
satisfies (J); that is, every isometry between two real normed linear spaces that is 
onto is an affine transformation (a mapping composed of a linear transformation 
and a translation). 


1.6.2 Pexider’s Equations 


The functional equations 


(PA) f@+y)=g8)+hAQy), 
(PE) f@+y)=gx)h(y), 
(PL) fy) = g@)+h(y), 
(PM) fy) = g@)h(y), 


treated by Pexider [666] with three unknown functions /, g, are known as Pexider’s 
equations, which are generalizations of the Cauchy equations (A), (E), (L), and (M), 
respectively. The solutions of these equations can be obtained by reducing them 
to the corresponding Cauchy equations and can be thought of as homomorphisms 
between two algebraic structures; the Pexider equations can be thought of as homo- 
topies between two algebraic systems. Before we treat (PA) to (PM) individually on 
various structures, we will unite them in the following result as a homotopism. 


Definition. Let Nj («) and N2(0) be groupoids (a set endowed with a binary opera- 
tion is called a groupoid). A triplet f, g,h : Nj — N2 is called a homotopy provided 
it satisfies the functional equation 


(1.77) f(xx*xy)=g(x)0h(y) ~forx,y eM. 


Theorem 1.55. Let G(«) be a groupoid with an identity e and H (0) be a group not 
necessarily commutative. The general system of solutions f, g,h : G — H of (1.77) 
is given by 


(1.78) f(x) = a0k(x)Ob, g(x) = a0k(x), A(x) = k(x)0b,  forx €G, 
where k is a homomorphism from G to H; that is, 
(1.79) k(x *y) =k(x)OK(y),  forx,y €M, 


and a and b are arbitrary constants in H. 


Proof. Letting x = e and y = e separately in (1.77), we have 


(1.80) f(y) = a0h(y), f(x) = g(x)0b, x,y EG, 


respectively, where a = g(e) and b = h(e) are in H. From (1.77) and (1.80) there 
results 
f(xy) = f(x)0b“'0a~'0f(y) forx,y € NM. 
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By defining 
(1.81) k(x) =a '0f(x)0b-!  forx EG, 
the equation (1.81) reduces to (1.79). Thus (1.81) and (1.80) give the result (1.78) 


we seek. 
From Theorem 1.55 there results the following theorem regarding the Pexider 


equations (PA) to (PE). 
Result 1.56. (a) The general system of solutions f, g,h : G(+) — H(+) of (PA), 


(b 


wm 


where G(+) is a groupoid with identity and H (+) is a group, is given by 
f(x) =a+ A(x) +b, g(x) =a+ AQ), h(x) = A) +d, 


where A is additive; that is, A satisfies (A) and a, b are arbitrary constants. 
When H = G = Rand any one of f, g,h, say f, is continuous at a point 
(or continuous or measurable, etc.), the solutions of (PA) are given by 


f(x) =cx+at+hb, g(x) =cx+a, h(x) =cx +b, 


where a, b, c are real constants, and when H = R”, G = R", the solutions of 
(PA) with continuous f are given by 


f@x)=c-x+atb, g(x)=c-xt+a, h(x)=c-x+b, x ER’, 


where a, b are constants in R™ and c is anm x n matrix over R. 
The most general system of solutions f, g,h : G(+) > H(-) of (PE), where G 
is a groupoid with a neutral element and H is a group, is of the form 


f(x) =aE(x)b, g(x) =aE(x), h(x) = E(x)b, 
where E satisfies (E) and a, b are constants, together with the trivial solutions 
f(x) = 0, g(x) is arbitrary, h(x) = Oand f (x) = 0 = g(x), h(x) is arbitrary. 


When G = R(+) and H = R(.), the solutions of (PE) with a continuous f 
are given by 
f(x) = abe™, g(x) =ae™, h(x) = be™, 


where a, b, c are arbitrary nonzero constants, together with the two trivial solu- 
tions above. 

The most general system of solutions f, g,h : G(-) ~ H (+) of (PL), where G 
is a groupoid with identity and H is a group, is given by 


fx) =at+LQ) +b, g(x) =a+L(x), h(x) = Le) +b, 


where L is logarithmic; that is, L satisfies (L) on G and a, b are arbitrary con- 
stants. 

When G = Ri (-) or R*(-) and H = R(-+), the general solutions of (PL) 
with f continuous at a point are given by 

f(x) =clog|x|+a+b, g(x) =clog|x|-+a, h(x) =clog |x| +b, 


where a, b,c are arbitrary constants. 


1.6 Some Generalizations of the Cauchy Equations 41 


(d) Functions f,g,h : T — R, where T = R or R* or R%,, satisfy (PM) if and only 
if either they have the (trivial) form f = 0 = g, harbitrary or f =O =h, g 
arbitrary, or there exists a function M : T — R satisfying the multiplicative 
equation (M) such that 


f(x) = abM(x), g(x) = 4M(x), h(x) = bM(x) 


hold, where a, b are nonzero reals. 
If f, g,h are continuous solutions of (PM), then they have the trivial forms 


and 

f(x) = ab, g(x) =a, h(x) =b, 
or 

F(x) =ab|x|", g(x) =alx|", hA@) = dle’, 
or 


f(x) = ab|x|*sgnx, g(x) =a|x|°senx, h(x) = b|x|° sgnx, 
where a,b € R* andc € R (withc > Oif0 ET). 


In Result 1.56, identity in G plays a crucial role. Similar results hold even if 
G does not possess an identity element, but we supplement it by other requirements. 
For example, we prove the following result (refer to [558, 12, 40]) using associativity 
to solve (1.77) and then deduce the solutions of (PA) to (PM). 


Theorem 1.57. Let G(*) be a commutative semigroup and H(Q) be a group (not 
necessarily Abelian). Functions f, g,h:G — H satisfy (1.77) if and only if 


f(xx*xy) =aobok(xxy)obod, 
(1.82) g(x) =aok(x)ob, 
h(x) =bok(x)od, forx,yéG, 


where k satisfies 
(1.83) k(xxy)=k(x)ok(y) forx,yEeG 
(that is, k is a homomorphism). 


Proof. Using associativity of («) and (1.77), we have 


f@ *y *z) = g(x * y) oh(z) 
= g(x)oh(yx*z), forx,y,z EG; 


that is, 


(1.84) g(x)! og(x ey) =A(y *z)oh(z)| =k(y) (say). 
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From 
g(x*y)=g(x)ok(y), forx,y €G, 


using associativity again, we get 
g(x #¥y #2) = B(x x y)ok(Z) = g(x) ok) ok) 
= g(x) ok(y *2z). 
Thus k satisfies (1.83). Using () commutative and (1.84), we get 
8(x) ok(y) = g(y) ok), k(y) oh) = k(Z) oh(Y) 
or 
(1.85) g(x) =aok(x)ob, h(z)=bok(z)od, 


where a = g(yo), b=k(yo)7!, d =h(yo). 
Note that b commutes with k(x) (use (1.83)). Now (1.77) gives, using (1.85), 


(1.86) faxxy)=aok(x)obobok(y)od=aobok(xxy)obod. 
Equations (1.85) and (1.86) yield (1.82). This proves the theorem. 
Corollary 1.57a. 


(i) Functions f,g,h : G(+) — H(+) (where G is a commutative semigroup and 
Ha group not necessarily Abelian) satisfy (PA) if and only if 


f(aty) =a+b+A(rt+y)+b+d, g(x) =a+A(x)+b, h(x) = b+A(x)d+a, 


where A satisfies (A). 
(ii) Let G(+) be a commutative semigroup and H (-) be a group. The general sys- 
tem of solutions f, g,h: G — H of (PE) is given by 


fiaty)=abE(x+y)bd, g(x) =aE(x)b, h(x) = bE(x)d, 


where E is a solution of (E). 
(iii) Suppose f,g,h : G(-) > H(+) (where Gis a commutative semigroup and H 
a group) are solutions of (PL). Then there exists an L satisfying (L) such that 


f(xy) =a+b4+L(ry)+b+4+d, g(x) =a4+La)+b, h(x) =b4+L(x)+d. 
(iv) The general system of solutions f, g,h : G(-) > H(-) (where G is a commu- 
tative semigroup and H a group) satisfying (PM) is of the form 
f(xy) = abM(xy)bd, g(x) =aM(x)b, h(x) = bM(x)d, 


where M satisfies (M). 
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1.6.3 Some Generalizations 


Proposition 1.58. Let us first consider the simple functional equation(s) 


(1.87) fatty =fRe)+FOM+F@OSFO), 


where f : S — F, S(+) is a groupoid, and F is a field. Then 


f@)=E(x)-1 or f@)=1-E() 
according to whether a + or — sign is taken in (1.87), where E satisfies (E). 


Proof. When a + sign occurs on the right side of (1.87), define E(x) = 1+ f(x) 
and add | to both sides; when a — sign occurs in (1.87), add —1 to both sides and 
define E(x) = 1 — f(x). Then (1.87) reduces to (E). 

For S = R = F with f continuous at a point, the solution of (1.87) is given by 


f(@)=e*—-1 and f(x)=-1 
or 


f(@~)=1-—e* and f(x)=1 


according to whether a + or — sign occurs in (1.87). 
The continuous solutions of 


(1.88) f(r +x2,y1 + y2) = filv1, 1) + far, y2) + fa(%2, v1) + fara, y2) 


are obtained in Kuwagaki [591]. In Theorem 1.59, we obtain the general solution of 
(1.88) in a general setting related to the Pexider equation and a biadditive function. 
Note that this equation has five unknown functions. 


Theorem 1.59. The general solution f, fj: H x H > G (i = 1, 2,3, 4) of (1.88), 
where H is a groupoid with neutral element 0 and G is a commutative group, is 
given by 


f@, y) = Bx, y) + Aix) + AY) +, 
fi, y) = Bx, y) + ui) +01(Q) — 1, 
(1.89) fo, y) = B, y) + Aix) — wi) + 02(y) +1, 
fax, y) = B@, y) + u2(x) + Aa(y) — 019) +1, 
fa(x, y) = BQ, y) + A(x) — u(x) + Ao(y) — v2n(y) te —c1, 


where B is biadditive (that is, B : H x H — G is additive in each variable), A,, A2 
are additive (satisfy (A) on H), uy, u2, 01, v2 are arbitrary functions, and c, c\ 
are arbitrary constants. 
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Proof. Setting x; = 0 = x2 = y; = y2 in (1.88), we have 


4 
f0,0) = >) fi,0). 
i=l 


With y; = 0 = yo, (1.88) results in the Pexider equation (PA) 
f(%1 + x2, 0) = (ft + f2) 1, 0) + (f3 + fa) (x2, 9), 


and by Result 1.56 (a), there exists an additive A; : H — G and constants A; and 
b, such that 


f(x, 0) = Ai(x) +a, + b1, 
(1.90) (fi + f2)(x,0) = Ai) +a), 
(f3 + fa)(x, 0) = A1(x) + dy. 


Similarly, setting xj = 0 = x2, (1.88) results in a Pexider equation (PA) such that 


fO, y) = A2(y) +. a2 + bo, 
(1.91) (fi + f3)0, y) = A2(y) + a2, 
(fo + fa), y) = A2(y) + b2, 


where A : H — G satisfies (A) and a2, b2 are constants. 
Putting sy = 0 = y2, x2 =O = y1, x1 =O = yy, and x; = 0 = yp separately 
in (1.88), we have 


fi@i,y1) = f@1, y1) — fo, 0) — f3@, y1) — fa, 0), 
fo(x1, y2) = f 1, y2) — fii, 0) — fa, y2) — f2(0, 0), 
fa(x2, yo) = f 2, y2) — fa(x2, 0) — fo(O, y2) — fi, 0), 
f3(x2, v1) = f 2, v1) — fa(x2, 0) — fi, y1) — f2(0, 0). 


Substituting these values of fi, f2, 3, 4 into (1.88) and using (1.90) and (1.91) 
yields 


(1.92) 


f (1 +2, 91 + yo) = f (x1, 1) + f 1, y2) + f 2, yi) + f (2, yo) 
— f (0,0) — Ay (x1) — ay — Ai (2) 
by — A2(y1) — a2 — A2(y2) — bp. 


Noting that A; and A? are additive and f (0,0) = aj + b} = a2 + b2 = c (say), the 
equation above can be rewritten as 


B(x, + x2, ¥1 + y2) = B(x, y1) + B(x2, y1) + BO, x2) + B(x2, y2), 


where 


(1.93) B(x, y) = f@, y) — Aix) — Aa(y) -c. 


It is easy to check that B is biadditive (that is, additive in each variable). 
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From (1.93), (1.92), (1.90), and (1.91), we obtain the solution of (1.88) given 
by (1.89), where ui (x) = fi(x,0), o1() = fi, y), 02) = fp, y), v2) = 
f3(x, 0). 


Remark. Suppose H = R = G in Theorem 1.59. If any one of the functions, say 
jf, is continuous at a point, then the solution of (1.88) is given by 

f(x,y) =axy +aixt+ay +e, 

fi@, y) =axy +ui(x)+ v1(y) —¢, 

fax, y) = axy + ax — uy (x) + 02(y) + ¢1, 

f3(%, y) = axy + u2(x) + ary — v1 (y) +1, 

fa(x, y) = axy +a x — u2(x) + ary — v2(y) +e — cy. 


1.6.4 Some Special Cases 
Let us now consider the functional equation 


(1.94a) f(x +y) = g@&)h(y) + kG). 


This equation contains (A) (when h = 1, g = k = f/), (E) (whenk = 0, g = 
h = f), (PA) (h = 1), and (PE) (k = 0). The solution of (1.94a) can be obtained in 
relation to the equations 

(1.94a1) Fax t+y)= Fly) + G(x)h(y), 

(1.94a) G(x + y) = G@)(y) + Gly) 


(refer to Aczél [12], Aczél and Daréczy [43], and Castillo and Ruiz-Cobo [144]). 
Suppose f, g,h,k: S > H satisfy (1.94a), where S(+) is a commutative semi- 
group with identity 0 and H is a field. 


(1.95) Either g = 0 orh = 0 in (1.94a) leads to f = constant = k(0) 


(put y = 0). Assume g 4 Oandh #0. 
Now f(x) = c1 gives g(x)h(y) + k(y) = c1; that is, 


(1.96) g(x) =c2, ky) =c1 —c2h(y), is arbitrary. 


Let us now assume that f ¥ constant. 
Substitute x = 0 in (1.94a) to obtain 


(1.97) f(y) = gO)h(y) + k(y) fory eS. 
Subtracting (1.97) from (1.94a), we have (1.94a), where 
(1.98) F(x) = f(x) — Ff), G@) = g(x) — g(0). 


First we determine F and G from (1.94a;) and (1.94a2) using associativity. 
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Considering F(x + y +z) as F((x + y) +z) and F(x + (y + z)) (that is, using 
associativity) and using (1.94a1), we get 


(1.99) (G(x + y)— GQ))h(z) = G(x)h(y +z) forx,y,z€S. 
Fixing zo in (1.99) such that h(zo) 4 0, (1.99) becomes (1.94a2), where 
$(y) = hy + zo)h~! Go). 
Note that 6(0) = 1. From (1.94a2), we have 


G(x +yt+z)=G& + y)b(z) + Gz) = [G@)d(y) + GO) ]b(z) + GE) 
= Gx)o(y +z) + GIy) (z) + GC); 


that is, Gx)[P(y + z) — 6) O(@)] = 0. 

Now G(x) = 0 implies F(x) = constant (from (1.94a;)); that is, f = constant, 
which is not the case. 

Thus ¢ is exponential and satisfies (E), say ¢ = E. Interchanging of x and y in 
(1.94a2) yields (note that ¢ = E) 


(1.100) G(x)(E(y) — 1) = G(y) (E(x) — I). 


Consider the case E(x) = 1. 

Equation (1.94a2) shows that G is additive, say G = A, g(x) = A(x) +c? (use 
(1.98)), and that (1.99) gives h = constant = c3. Again (1.94a;) with y = 0 gives 
F(x) = c3G(x) = c3A(x) (note F(0) = 0) or 


(1.101) f(x) =A)c3 + ¢1. 

Substituting this f in (1.97) yields 

(1.10la)  k(y) =c3A(y) +c} —c2c3. Note thath = c3, g(x) = A(x) + ¢2. 
Finally, consider the case E(x) 4 1. Use (1.100) to get G (and thus g); that is, 
(1.102) g(x) =c(E(x) — 1) +02, c £0. 


(c = 0 gives G = 0 and that F in (1.94a,), so f is constant.) 
Now y = 0 in (1.94a1) gives F(x) = c3G(x) = c3c(E(x) — 1); that is, 


(1.102a) f(x) = cc3(E(x) — 1) 4+ ¢1. 

Using (1.102) and (1.102a) in (1.94a1), we get (specializing x) 
(1.102b) h(x) = c3E (x). 
From (1.102a), (1.102b), and (1.97), we get 


(1.102c) k(y) = (cce3 — c2¢3)(E(y) — 1) +1 — €203. 
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Theorem 1.60. The most general systems of solutions of (1.94a) on S are given by 
(1.101), (1.101a), (1.102), (1.102a), (1.102b), (1.102c), (1.96), and (1.95), where E 
satisfies (E) and A is a solution of (A). 


Remark 1.60a. As for (1.94a1), in addition to the trivial solutions G = 0, h arbi- 
trary, F = constant and G arbitrary, h = 0, F = constant, the solutions of (1.94a1) 
are given by 


G(x) = A(x), F(x) = 3A) +1, bh) = 03, 
or 


G(x) =c(E(@) — 1), F@) =cce3(E(x) — 1) +.¢1, hQ@®) = c3E(X), 


where E and A satisfy (E) and (A), respectively, on S. 


Remark 1.60b. Barring the trivial solutions G = 0, ¢ arbitrary or 6 = 0, F = 
constant of (1.94a;), the most general solutions of (1.94a2) are given by 


$=1,G(x) =A) or $@) = E(x), G(x) =c(E@) - 1), 


where A and E are solutions of (A) and (EB), respectively. 


The solutions above follow from the proof of Theorem 1.60. We also provide a 
direct proof of (1.94a1) starting from 


(1.99) [Ga+y)—Gy)]h@) =G@)hQ +z) forx,y,zeS. 
Since h # 0, we obtain 

(1.991) G(x + y) = G(y) + G(x) - BQ), 

where 


By) = AQ + z0)h7! (zo) (put z = zo) 
= G(x) + G0) f(x), forx,y € S; 


that is, 
(1.9911) G(x)[B(y) — 1] = GO)IP@) - 1). 


Case 1. Suppose f(x) = 1. Then (1.991) shows that G is additive, say G = A. This 
G = A in (1.99) gives h = constant = c3. Use (1.94a1) to get 


Faty)= F(y)+c3A(x); 


that is, 
F(x) =c1+c3A(x) (put y = 0). 
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Case 2. Assume f(x) 4 1. Equation (1.9911) yields 
G(x) = c(B(x) — 1), 
which in turn in (1.991) shows that / is exponential, say 6 = E. These values of G 
and f in (1.99) give 
A(y +z) =AG)E(y); 


that is, 
h(Qy) =c3E(y) (take z = 0). 


Substituting these values of G, A in (1.94a,) yields F(x) = zc3(E(x) — 1). 


Remark 1.60c. Note that (1.94a) determines five unknown functions and no regu- 
larity condition is used (see also [12, 44, 818, 144]). 


Remark 1.60d. The commutativity of S is used only once to obtain (1.100). The 
results could be achieved by assuming that f satisfies the (K) condition. 


Corollary. The general nontrivial system of continuous solutions of (1.94a) for § = 
H = Ris given by 

f(x) = be3x +1, g(x) = bx +0, 

h(x) = 03, k(x) = be3x + cy — €203, 

fer=cae" =e. 6@)= ee" $3); 

h(x) = c3e'*, k(x) = (cc3 — c2c3)(e — 1) +c — €2€3, 


(1.94a’) and 


that of (1.94a2) is given by 
(1.94a‘) d(x) =1, G(x) =bx or d(x) =e, G(x) =cle* — 1), 
and that of (1.94a1) is given by 
G(x) = bx, F(x) =c3bx +e, h(x) =c 
(1.941) tor (x) ibe 3 I is 3 . 
G(x) = c(e* — 1), F(x) = cc3(e* — 1) +. €1, h(x) = c3e?*. 


Sometimes one encounters the dual of (1.94a), namely 


(1.94b) f(xy) = g@)h(y) +kQ), 


where f,g,h,k : S — H, S(-) is a commutative semigroup with identity e and H 
is a field. This equation contains (L), (M), (PL), and (PM). From Theorem 1.60, we 
deduce the following result. 


Result 1.61. The general system of solutions of (1.94b) is given by (1.95), (1.96), 
and (1.94b’) 


f@)=c3L(@%) +1, g) = Le) +03, 
h(x) = 3, k(x) = c3L (x) + c1 — €2¢3, 


f(x) = ce3(M(x) — 1) +c1, g@®) = e(M(x) — 1) +02, 
h(x) = c3M(x), k(x) = (ec3 — c2¢3)(M (x) — 1) + c1 — €20€3, 


(1.94b’)  } and 
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where L (logarithmic) is a solution of (L) and M multiplicative is a solution of (M) 
on S. 


Note that Result 1.61 also gives the solution of 
(1.94b)) Fry) = gQ)AG)+k(x), forx,y €S, 


as (1.94b’). 
For S = R, H = R, the system of solutions of (1.94b) for which / is strictly 
monotonic is given by 


f(x) = c3bk log gx +c), g(x) = bk log gx +63, 

h(x) = 03, k(x) = bk logx +c, — c203, 

f@) =cae -—I+e1, g&)=c@?-N+e, 

h(x) = 03x, k(x) = (cc3 — c2€3)(x® — 1) +c] — €2¢3. 


(1.94b”) and 


Remark 1.62. The general system of solutions of 


(1.94a3) wx + y) = w(x)a(y) + BY) 


where y,a, 2 : S > H is given by 


a=0, y=c=8, 
y(x) =c, a arbitrary, B(x) = cl —a(a)), 

(1.94a4) y(x) = A(x) + bi, a(x) = 1, B(x) = AQ), 
y (x) = c(E(x) — I +¢1, a(x) = EQ), 
By) = (ec —e1)(EQ) - 1), 


or 


respectively, where E satisfies (E) and A satisfies (A) on S. 


The general nontrivial system of continuous solutions of (1.94a3) for S = R = H 
is given by 


w(x) =bx +b, a(x) = 1, B(x) = dx, 


(1.94a’ or ; 
w(x) =c(e* —1)+c1, a(x) =e, B(x) = (c— c1)(e% — 1), 


respectively. 
Let us give an application of (1.94a3) in the theory of mean values and operations 
(see Aczél [12], p. 152). We start with the functional equation 


(1.103a) f(ax + by +c)=ge(x)+hO)+c’, withab £0, 
and the general linear equation 


(1.103b) flax +by +c) =a' f(x) +b' f(y) +c’, with aa’bb' 40 
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(refer to Aczél [12], Daréczy [191, 192], Kuczma [558], Losonczi [613], and Castillo 
and Ruiz-Cobo [144]). 

Let f,h, g : Fi — Fo, where F and F> are fields. If in (1.103a) either f, g, or 
h is a constant, so are the others. So, we will consider only nonconstant solutions of 
(1.103a). 

Putting y = 0 and x = 0 separately in (1.103a), we get 
(1.104a) flaxtc)=ge(x)+hQO)4+c’, a0, 
(1.104b) fly+co)=g0)+hQy) +c’, b#0. 
Note that f(c) = g(0) + h(0) +c’. 

Substituting (1.104a) and (1.104b) into (1.103a), we obtain 

flax +by +c) = flax t+e)t flbyto)—f@, ab #0; 

that is, 


fa+ytey=f@+c)+fOtc)— fl). 
Hence A(x) = f(x +c) — f(c) satisfies (A), so that 


(1.105) f(x) = A(x) + f(c) — A(c) = A(x) + f (0). 
From (1.105), (1.104a), and (1.104b), we have 
(1.106a) g(x) — g(0) = A(ax), h(y) —h(0) = Alby), a,b 0. 


If we consider (1.103b) (take g(x) = a’ f(x), h(y) = b’ f (y)) as before, we obtain 
(1.105) and 


(1.106b) a’ A(x) = A(ax), b'A(y) = A(by), aba’b' £0. 
Thus we have proved the following theorem. 


Theorem 1.63. Let f,g,h : Fi — Fy. Then the general nonconstant solutions of 
(1.103a) and (1.103b) are given by (1.105) and (1.106a) and (1.105) with (1.106b), 
respectively, where A is a solution of (A). Furthermore, if F, = R = F2 and any 
one of the functions is continuous or continuous at a point or monotonic, etc., then 
the solution of (1.103a) is given by 


(1.103a’) f@)=dx+a1, g(x) =dax +02, h(y) =dby +43, 
with dc + a, = a2 +43 +¢, and that of (1.103b) is given by 
(1.103b’) f(x) =dx+awitha=a', b=b', dc—c' =(a+b—|)a, 


with d # 0. From (1.106b) follows a = a’ and b = b’. As a matter of fact, we 
also have the following from (1.103b): c= c', a=1=b=a' =D’ gives (A); 
a+b=1,c=0=C' gives f(ax+(1—a)y) =af(x)+U-a)fiy) @= 5 
gives Jensen’s equation); anda+b=1, c #0 gives f(ax+(U-—a)y+c) = 
af (x) +(—a)fQ) +e. 
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We will now consider the following functional equations arising in the theory of 
mean values using (1.94a3) and (1.103b): 


sates | 2 (qtespesuen) = a +t, 
. F(g(x)) =x, 
af(xt)+bf(yt)\  — (af(x) +f) _ 
(1.107b) p (Peron) =. (Oe), f(g@)) =x. 


Suppose f is a strictly monotonic surjective continuous mapping on R+ satisfying 
(1.107a) for x > 0. Note that g = f~! and the codomain of f is a nondegenerate 
interval, say J. 
With f;(x) = f(@ +1), gr(x) = g(x) —1, for fixed t in R4_, (1.107a) goes over 
to 
F eS = oe 
“| eS lee ee 


a+b as ). fulgr(x)) =x. 


By hypothesis on f for u,v € J, there exist unique x, y € R + such that f(x) = u, 
g(y) =0, x = g(u), y = g(v), are satisfied so that the equation above becomes 


say ob) = manonan 


which is of the form (1.103b), where f is replaced by fig. 


Remark. The continuous solution of the Jensen equation (J) is given by f(x) = 
cx +b (see Theorem 1.52). The same holds for (1.107a1) (fg) (ape) 
a(fighw+big)l| 

a+b 


, a,b > 0. Hence 


(fig)(u) = c(t)u + a(t); 


that is, 


fatt)=cf@)+ a(t), 
which is (1.94a3). Thus follows the result. 


Result 1.64. The general strictly monotonic continuous solutions of (1.107a) are 
given by 


(1.107a’) f@)=dx+a and f(x)=de* +a. 
Similarly, for (1.107b) the dual solutions are 


(1.107b’) f(x)=dlogx+a and f(x)=dx°+a (c,d#0). 
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1.6.5 More Generalizations 


The Mikusitiski-Jensen functional equation 


(M-J) f (- a ) Ee + £0) 2h (=) =0, 


2 
where f : R— Ror f : (a,b) — R, is a special case of the functional equation 


(A-G) f(x) f(px + (1 — p)y)+ fO) f(A — p)x + py) 
= f (px + (1— p)y)’ + f((l — p)x + py)’ 


(for p = 5). which was introduced by Alsina and Garcia-Roig [71] in connection 
with the classical problem of duplicating a cube. We will deal with (A-G) later in 
the application. We now treat (M-J) and relate it to (1.59), the Mikusinski functional 
equation, and produce the following result due to Lajko [599]. 


Theorem 1.65. Jf the mapping f : R — R satisfies (M-J), then there exist an 
additive A and a constant b such that 


(SM-J) f(x) =AQ)+b forx ER. 


Proof. As remarked by Lajko, a proof can be presented based on a result of Baron 
and Ger or Ger in connection with a generalization of (1.59). 


Case f (0) 4 0. With y = —x, (M-J) goes over to 
f(x) + f(-—x) -—2f(0)=0 forx eR. 
The function h : R — R defined by 
h(x) = f(x) — FO) 
is odd, h(O) = 0, and (M-J) becomes 


E (=) + r00)| ie +h(y) — 2h (=) =0 forx,yeR. 


Replacing x and y by —x and —y in the equation above and adding the resultant 
equation to the equation above, using h odd and f (0) 4 0, we get 


h(x) +h(y) = 2h (=) forx,y ER, 


which is from Jensen [394]. Thus we get (SM-J) with b = f(0) 4 0. 
Case f (0) = 0. Equation (M-J) can be rewritten as 


x+y P| 
(5 ) [tora (2 )|=o 
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by replacing x and y by x + y and 0, respectively. Thus, if f (=) # O, then 
f@ty) =2f (=) # 0, and it follows from (M-J) and f(x + y) # 0 that f 
satisfies (A); that is, (1.59a), which is equivalent to (1.59). Then, by [218, 221, 232], 
there exists an additive A : R — R such that f(x) = A(x); that is, (SM-J) holds 
with b = 0. 

The following result also holds (Lajko [599]). 


Result 1.66. [f f : R > R* (or f : (a,c) C R > R*) satisfies (M-J), then (SM-J) 
holds where b # 0. 


1.7 Extensions 


Extension problems arise in many branches of mathematics; for example, extensions 
of homomorphisms and isomorphisms are known in algebra. These can be consid- 
ered extensions of functional equations and arise in many applications in the study 
of functional equations. The problem is that one is interested in either extending 
functions satisfying a functional equation on a “smaller” or “restricted” set to func- 
tions that satisfy the same equation on a “larger” or “maximal” set or proving that 
the given functions satisfying a functional equation on a “smaller” or “restricted” 
set indeed satisfy the same equation on a “larger” set. The next question is, when 
the extension exists, is it unique? One can cite the following well-known example. 
Suppose f : T > R(T = [0, co] or [0, 1]) satisfies (A) (in the case of [0, 1], (A) 
holds when x + y also is in [0, 1]). Is it true that there is an additive A : R — R that 
is an extension of f? The answer to this question is affirmative, and the extension 
is also unique (as we will show later). But, in general, there may not be an exten- 
sion, and when it exists, it need not be unique (see Section 1.6.5). We illustrate these 
extensions in the results to follow. We start off with the following theorem related to 
the example above. 


Theorem 1.67. (Aczél, Baker, Djokovic, Kannappan, and Rado [36]). Let G(-) and 
H (0) be groups (that need not be Abelian), S be a nonempty subsemigroup of G 
such that G = SS! = {st~!: s,t € S}, and f : S > H be a homomorphism; that 
is, 


f(st)= f(s)o f(t) fors,téS. 
Then there exists ahomomorphism g : G — H that is a unique extension of f. 


Proof. Define g : G > H by 


g(st') = f(s)o f(t)! fors,t eS. 


This g is well defined, for if st~! = uv7', for u,v,s,t € S, then s = uv~'t and 
there are r, w € S such that vo!t =rw! andtw = vor, Sw = ur, so that f(t) o 
f(w) = f(v)o f(r) and f(s)o f(w) = f(u)o f(r). Hence f(s)o f(w)o f(r)! = 
fu), and f(w)o f(r)! = f(t)! 0 f(v); thatis, f(s)o f(Q7! = flu)o fey}. 
That is, g is well defined. 
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We will now prove that g indeed is a homomorphism. Let x, y € G. Then there 
are s,t,u,v,r, w € S such that 


x=uo lt, y =rs',v 'r=tw ! and f(r) o fw) = fv) o f(t). 


Now, 


g(xy) = g(uv!rs~!) = gut: (sw)7!) = flut)o f(sw)! 
= flujo f(t)o f(w)'o f(s)! = flu) o f(v)' 0 f(r)o f(s)! 


= g(x)o g(y); that is, g is ahomomorphism. 


Further, for s,t € S, g(s) = g(st-t7!) = f(st)o f(t)~! = f(s) shows that g is an 
extension of f. Finally, there is the uniqueness of g. Suppose / is also an extension 
of f. Then 


A(st—') =A(s)oh(t)"' = f(s)o fF)! = g(st!). 


This proves the theorem. 


Remark 1.68. If G and H are Abelian groups, the proof is even simpler (see [53]). 
Also when G = H = R(+), S = [0, oo[ (the example cited before Aczél and Erdés 
[45]), every function f : [0, co[— R satisfying (A) can be extended to a solution A 
of the same equation on the whole plane, A(x + y) = A(x) + A(y), for (x, y) € R?, 
so that A(x) = g(x), for x > 0. 


The same result can also be obtained from the following result. 


Result 1.69. (Aczél, Kannappan, Ng, and Wagner [53]). Let G(-) and H(o) be 

groups and S be a subsemigroup of G such that every element x of G different from 

the neutral element of G belongs either to S or x~! € S (or both). Then every 

homomorphism f : S — H has a unique extension to a homomorphism g : G > H. 
Note that S generates G and G = SS~'. 


Now we will see an example of a homomorphism on a group finitely generated 
by a subsemigroup. 


Theorem 1.70. (Mokanski [637]). Let G(-) and H(o0) be groups and S a subsemi- 
group of G such that G = SS~'S. Ifa mapping f : G > H satisfies the equation 


flab“! -c) = f(ab“)o f(c), forab~! € SS~! andc € Sora,b,céS, 


then 


fay)=f@)of) forallx,yeG. 
Proof. First we will show that 
flab) = fa@ofb) fora,be S; 


that is, f is ahomomorphism of S into H. 
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For a,b € S, by the definition of f, 
f (ab) = f(abb~'.b) = f (abb™') 0 f(b) = f(a)o f(b). 
Also, f (ab) = f (ab~! - b?) = f (ab~!) o f(b’), so that 
flab) = fajo f(b)! fora, be S. 
Note that f(ab~!c) = f(a)o f(b)“!0 f(©) fora, b,c ES. 


Let x,y € G. Then there exist a,b,c,u,v,w,r,s,t in S such that x = 


ab~'c, y =uv—!w, b~'cuv—! = rs—'t, so that cu = brs—! - tv and 


f(cu) = flbr)o f(s)! 0 ftv) = fb)o fr)o f(s)! 0 fo f(r). 
Now, 


f(xy) = f(ab'c . uv !w) = f(ar got tw) = f(ar)o fish" o f(tw) 
= f(a)o f(r)o f(s)! o fo fw) 
= f(a)o f(b)! o fo fluo fv)! 0 fw) 
= f(x)o f(). 


This proves the theorem. 


Before discussing more results on extensions, we will provide examples to show 
that an extension may not be possible, and when it exists (is possible), it need not be 
unique. 


Examples 


(1) Let f : Ry — R such that 


fx) = & for x > 0, 


, forx=0. 
Obviously, f(x + y) = f(x) f(y) for x, y € Ry. This f has no extension E to 
R satisfying (E) on R (whenever E is zero at a point, then E is identically 0 in 


R). 
(2) Let f : Ri. > R bea nonzero solution of 


f(xy) = f@)+ fQ). 


This f has no extension to R+ or R (such an extension is identically zero on Ry 
or R). But this f has an extension to R*, 


(ie f(x), for x > 0, 
Pe Bet. Pata 4, 
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(3) Let g: R > R be defined by g(x) = x° for x > 0. Then g(xy) = g(x)g(y) for 
x,y € R4, and this g has two extensions hj, h2 to R, namely hj (x) = |x|° for 
x ER; ho(x) = (sgn x) |x|° ,x € R (141 |Ry = g and h2|R+ = g). 


Now we will consider more results on extensions. 


Result 1.71. [229, 558, 95]. Suppose G and H are Abelian groups with H divisible 
and S is a subsemigroup of G. Let f : S — H bea homomorphism. Then there 
exists ahomomorphism h : G — H such that h|S = f. 


Remark 1.71la. This result fails to hold if H is not divisible. For example, let 
G = H = Zand S = 2Z. Then Z is not divisible and f : 2Z — Z defined by 
f Qn) =n forn € Z is a homomorphism with no extension h : Z — Z (such an 
extension has to satisfy h(3) + h(3) = h(6) = f(6) = 3, and 2A(3) = 3 has no 
solution in Z). 

Further, the Aczél and Erdés [45] problem has a solution. Also, if S does not 
generate G, then the extension / need not be unique [229]. 


1.7.1 Extension of the Additive Function on [0, 1] 
Problem. f : 7 — R such that 


(1.108) faty=f@)+fQ), forx,y,x+yel, 


holds. Does there exist an additive A : R — R satisfying (A) such that A|J = f? 
If it exists, is A unique? 

These types of problems arise in information theory, statistics, and other fields. 
The answer to the problem above is affirmative and was first proved by Daréczy and 
Losonczi [203]; also see [43, 55]. Here we present a slightly different proof, first by 
extending f to [—1, 1] and then to R. 

Define g : [—1, 1] ~ R by 


f(x), for x € [0, 1], 
gx) = 
—f(-—x), forx € [—1,0]. 


First note that g(—x) = —g(x) for x € [—1, 1] and g(0) = 0. We will show that 
(1.108a) gx +y)=8@)+8(), forx,y,x+ye[-l, 1. 


Obviously (1.108a) holds for x, y > O withx + y € J. 
Letx <0, y <Owith—-1 <x+y <0. Then 


g(x +y)=—f(-x —y)=—-(f(-«) + f(-y)) = g@) + 80). 


Letx <0, y > 0. Then two cases arise according to whether x+y <Oorx+y > 0. 
First take the case x + y < 0. Then 


gix)=gaty—-y)=gety)t+ea(-y)=et+y)—-gQ). 
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Now consider the case x + y > 0. Then 


gy)=aty-x)=g@ty)t+a(-x)=g(x+y)—g). 


The cases where x > O and y < 0 with x + y € [—1, 1] can be treated similarly. 
Thus (1.108a) holds in a “hexagonal” or “diamond” neighbourhood of (0, 0). This g 
is unique. 

Now we extend g on [—1, 1] to A on Ras in Aczél and Losonczi [55]. 

Define A : R > R by 


A(x) =ng (=), forx € R, ne Zi, at 


This A is well defined since, for = € [—1,1], n- =, m- = are in [—1, 1] and 
g(nt) = ng(t) fort, nt € [—1, 1]. Now 


ne() =nne (i) =nme (22) ne (2) 


Also A(x) = g(x) for x € [—1, 1] (taken = 1). 


Finally, for x, y € R, choosen € Z such that =, 2, 


n?n? n 


Atty) ane (* =") =n[g (=) +8(=)| = A(x) + A(QY). 


Further, A is unique, for if A; is another extension on R of g on [—1, 1], then for 
arbitrary x € R, n € Z+, with * € [-1, 1], it follows that 


are in [—1, 1]. Hence 


Ai(x) =nAy (=) =ng (=) = A(x) forxeR. 


This confirms that the extension is unique. 


Remark 1.72. The result above holds on any interval [0, s]; that is, if f : ]—-R 
satisfies (1.108) for x, y,x + y € [0,5], then f has an additive mee on oR. 


Remark 1.73. Unique (additive) extensions exist in the following cases, too. If f : 
]-—r,r[— R( > 0) is additive either on H, = {(x, y): x,y,x +y €]-—r7rh 
(hexagonal neighbourhood of (0,0)) or on K, = {(x, y) : x? 4 y? < r?} (circular 
neighbourhood of (0, 0)), then there exists a unique A : R — R satisfying (A) such 
that A is an extension of f (see [203]). 


We will see the use of these in applications later. Now we use then to prove the 
following result arising in information theory. 

First we prove the following extension result similar to the problem discussed 
earlier, but this time the domain is the open unit interval as follows. 


Theorem 1.74. Let f : ]0, 1[ > R satisfy 


faty)=f@)+ f(y) forx,y,x+y €]0, IL. 
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Then there exists a g : [0,1] > R satisfying 


g(x+y)=8)+e() forx,y,x+ye[0, 1] 


such that g\}0, 1[ = f. Thus there exists an additive A : R — R that is an extension 
of f. Clearly this extension is unique. 


Proof. First note that kf (t) = f (kt) for t, kt in ]O, 1[, where k is a positive integer. 
For integers n,m > 1, claimnf (+) =mf (+) ; 


ate Dice ae ab ail : 
Indeed, since — aT aN aa GaN zz are all in JO, 1[, 


f(x), for x € ]0, I[, 
g(x) = 49, for x = 0, 


nf (4). forx =1. 


Claim g(x + y) = g(x) + g(y) forx,y,x+yin/. 
For x, y,x + y in JO, I[, it is true. 
Forx =0, y € JO, 1[, x + y € ]O, 1[, and 


g(x +y) =g(y) = 80) +80) = g(®) + 8). 


Forx =0, y=1, g@+y)=80) = 8(0)+ 80) =8@) +80). 
Boe for x, y in JO, 1[ with x + y = 1, choose a positive integer n such 
y + , * * c+ , 3 ) + M 
that +, +, and = are in ]0, I[. Since f (=) = f (+) + f (4), nf (=) = 


n? n? 
nf (£) +nf (2) and g(x+y) = g(1) = nf (4) =nf (4) +nf (2) =8@) +80). 
Thus g extends f on ]0, 1[ to [0, 1]. So, by the previous result, there is an additive A 
on R that extends g on [0, 1] and hence extends f on ]0, 1[. 


Remark 1.75. The result above holds on any interval ]0, s[; that is, if f :]O,s[ > R 
satisfies f(x + y) = f(x) + fQ) forx, y,x + y € ]0,s[, then f has an additive 
extension on R. 


Theorem 1.76. The functions f; : ]0,1[ — R satisfy the functional equation 


(1.109) > fii) =0 (° <pr<1, > p= ) 
i=l 


i=1 


for arbitrary (but fixed) n > 3 if, and only if, there exists an additive A: R > R 
such that 


(1.110) filx) = A(x) +b; forx €]0, 1, 


n 
where bj (i = 1,2,...n) are constants with AA) + > bj = 0. 
i=1 
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Proof. We choose a € ]0,1[ such that (n — 3)a < 1, b = 1 — (n — 3)a. Putting 
Pl=xX, po=y, pp=b—-x—Yy, p4=--++ = Pn =a in (1.109), we obtain 


A@)+ho)+ BO-x-y)=k forx,y,x+y € ]0, dL, 


where k is a constant. Interchanging x and y, we have fj(x) — fa(x) = fily) -— 
f2(”) = c (say), so that the equation above can be written as 


(1.111) fi@)+ fi0) -—e+ falb-—x —y)=k, 
(1.90) 


with & a constant, for x, y,x + y in ]0, b[. 
Forx,y,z,x +y+z € ]0,b[, (1.111) yields 


Boas = 7 =z) =k PeHS fiery) = fi) 
=k+c— fix) — fi +2); 


that is, fi(x + y) — fi@) = fi +y) — fi) = d(y) (say). 
Hence, 


fi +y) = fi) + dy) = fi) +4) 
so that f(x) — d(x) = c, and 


A@t+ty=fiC)+fiO)—e¢ forx,y,x+y € 0, bl. 


Thus, by Remark 1.75, there exists an additive A : R — R such that 
fi) = AQ) +41. 


Since f; is arbitrary, the form above holds for all fj (@ = 1,2,...n) forx € 
[0,b]. Thus fix) = Aw) + 5; @ = 1,2,...n). Now, given x € ]0, 1[, we can 
always find a “b” satisfying the condition above (A and b; are independent of b 
because of the additivity of A). Hence the solution of (1.109) is given by (1.110) for 

n 
x € ]0, I[ with AC) + 3° 5; = 0. 
i=1 

This proves the theorem. 


1.7.2 Quasiextensions 


As pointed out in the examples, there an additive extension may not exist even in 
some very simple cases. Suppose 


5x+2, forx€]1,4[, 
(1.112) f(x) = 45x43, if5 <x <7, 
5x+5, if6<x <1. 
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Then the function f : Dr = ]1,4[ U ]5,7[ U ]6, 11[ > R is additive for (x, y) € 
D = )1,4[ x ]5,7[, but there is no additive A : R — R that extends f (that is, 
Alp; = f) for such an A would be continuous on Dy and A(x) = cx, for x € R, 
which contradicts A|p, = f. 

Nevertheless, A(x) = 5x is a quasi extension of f; that is, A differs from f on 
J1,4[, ]5, 7 and ]6, 11[ by a constant and each is additive on R. 

However, if we let 


f(z) =5z+5  forz € 6, 11[, 
(1.112a) g(x) =5x+2 whenl <x <4, and 
h(y) =5x+3 ify €)5,7L 


then 
f@+y)=g@)+hA(y) for, y) €]1,41 x J5, 71, 


and there exist unique functions F, G, H : R > R such that 
Fly =f, GlIN,4l= 98, AlN5,71=h, 
and 


F(x +y)=G(x)+ A(y) forall x,y € R. 


These lead to the notion of quasiextensions. Let X(+), K («) be groupoids and 
Y(+) be a commutative group. Given a nonempty subset D C X x X, we denote 
Dy, = {x : (x,y) € D for some y € X}, Dy = {y : (x,y) € D for some 
x €X}, Do={x+y:(,y) € D}, and Dj = Do UD, U Dy. 

We say that a function f is additive on the set Dif f : Dj — Y satisfies (A) for 
(x, y) € D.A function A : X — Y is called an (additive) extension of a function f 
on D if (A) holds for A on X and A|p, = f. 

If there is an additive F : X — Y anda point (a, b) € D such that 


fxt+ty)—f@—fb)=Fat+y)—F@—F) forx+ye Do, 
f(x) -f@ =F(x)- F(a) forx € Dy, 
f(y)— fo) = F(y)- F@)  fory € Dy, 
hold, we say that F is a quasiextension of f. 


It is easy to check that this definition is independent of the choice of (a, b) (for 
(c,d) € D, take x = c and y = d in the equations above). 


1.7.3 Extension of the Pexider Equation 
For f : Do > K, g: Dy > K,andh: Dy — K (where K (+) is a groupoid) such 


that 
fat+y)=g(x)*h(y) forall, y)eDCXxxX 


1.7 Extensions 61 


(that is, f, g, h satisfy a Pexider equation), if there are functions F,G,H :X > K 
with F|Do = f, G|Dy = g, H|\Dy =h, and F(x + y) = G(x) * H(y) for all 
x,y €X, (F, G, H) is said to be a Pexider extension to X of (f, g,h) on D. 

The function f in (1.112) has quasiextension to R, and (f, g, 2) in (1.112a) has 
Pexider extension (not only quasiextension) to R?. As for quasiextension, the basic 
results are from Daréczy and Losonczi [203], Riman [703], Aczél [30], and Rado and 
Baker [676]: If f is additive on the circular disk D = {(x, y): («—a)?+(y—b)? < 
r7}, then there exists a quasiextension A of f to R*; from any open connected set 
D in R? there exists a unique quasiextension of the Cauchy equation on D to R? 
[203]. Let D C R?” be an open, connected set, Dy = {x € R” : (x, y)} in D for 
some y € R", Dy = {y € R" : for some x, (x, y) € D}, Do = {z € R", z = 
x+y, (x,y) € D}, D’ CR" such that Dy, UD, UDo © D’. If f : D! > R satisfies 
the additive equation (A) on D, then there exists a unique additive A : R” > R and 
constants b,c € R such that 


f(x)=A(Qx)+b forxeD,, f(y) =A) +c fory € Dy, 


and f(x + y) =A(x+y)+b+c forx + y € Do hold. 

As regards Pexider extension, in general the extension may not exist. For exam- 
ple, if f,g:R— Randh : 2Z —> Rare defined by f(x) =k +sinzx, g(x) = 
sinax forx € R, h(2n) = k forn € Z, and k is constant, then f(x + 2n) = 
g(x) + h(2n) for all (x, 2n) € R x 2Z, but there is no H from R to R such that 
A\2Z =h and f(x + y) = g(x) + H(y) for all x, y € R (see [676, 30]). 

The following result is due to Baker and Rado [676]. 


Result 1.77. Let U be a nonempty open, connected subset of IR" x RR" and K be an 
additive Abelian group. Suppose f : Up > K, g: Ux > K, andh: Uy > K 
satisfy f (x +y) = g(x) +h(y) for all (x, y) € U. Then there exist unique functions 
F,G,H : R" = K that extend f, g,h, respectively, and F(x + y) = G(x) + H(y), 
forall x, y € R". Further, there exists a unique additive A : R" — K and constants 
a,b € K such that F(x) = A(x) +a+b, G(x) = A(x) +a, and H(x) = A(x) +b 
forallx € R". 


Recall that f, g, h satisfying a Pexider equation is called a homotopism (f, g, /). 
Regarding a homotopism (f, g, 4) of a semigroup S into a group K, we prove the 
following result due to Radé [675] which does not involve topology. 


Theorem 1.78. Let G1 (-) and G2(-) be groups and S a subsemigroup of G, such that 
G, = SS-! = S“'S. Let (f, g,h) be a homotopism of S into Gz. Then there exists 
a homotopism (F, G, H) from G, into G2 and a (multiplicative) M : G, > G2 
satisfying M(xy) = M(x)M(y) for x, y € G, and constants b,c € G2 such that 


F(x) =bM(x)c, G(x) = bM (x), H(x) = M(x)c, 


F|SS = f\|SS, GIS=g, and HA|S=h. 
Proof. Suppose f, g,4 : S > G2 satisfy f (st) = g(s)h(t) fors,t € S. 
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Define M : G; —> K by 
M(x) = M(st~') := h(s)h(t)!, 


where x = st—! for some s,t € S. First, we will show that M is defined unambigu- 
ously and then that M(xy) = M(x)M(y) holds for x, y € Gy. 

Suppose st-! = uv! and st! = wa r, for u,v,w,r € S. Then ws = 
rt, wu =rov, and 


f(ws) = g(w)h(s) = ft) = gh), 
f(wu) = g(w)h(u) = f(rv) = g)h(v). 


Eliminating g(w) and g(r), we have h(v)h(u)~'h(s) = h(t); that is, h(s)h(t)~! = 
h(u) -h(v)~!. So, M is well defined. 

Now, we prove that M(st) = M(s)M(t) and M(st~'!) = M(s)M(t)~! for 
s,teS. 


Fors,t € S, 
M(s) = M(st -t~') = h(st)-A(t)7!; 
that is, 
h(st) = M(s)h(t) fors,t € S. 
Forr,s,t € S, 


h(str) = M(st) -h(r) 
= M(s)-h(tr) = M(s)M(t)h(r); 


that is, M(st) = M(s)M(t), for s,t € S (M is multiplicative on S). 

Let uv~! =t7'r foru,v,t,r € S. Thenst—! = st—!rr-! = 5- uv—!-r7! for 
s,t,r,u,v € S.Now tu =ro, h(tu) = h(rv); that is, M(t)h(u) = M(r)h(v) or 
M(r)-!M(t) = h(v)h(u)7!. 


1 


M(st—!) = M(su - (rv)~') =h(su)-h(ro)7! 

= M(s)h(u)h(v)'M(r)! 

= M(s)-M(t)"'M(r)- M(r)7! = M(s)- M(t). 
Let x,y € Gy . Then there are v,5,t,u,0,w € S such thatx = rs‘, y = 
tu-', s~'t =v-w7!, andtw = sv, with M(t)M(w) = M(s)M(v). 

Finally, 
M(xy) = M(rs7! : tu‘) =M(r- vw tu!) 

= M(rv)-M(uw)7! 

= M(r)MQ) -M(w) Mu)! 

= M(r)M(s)* - MQ) - Mw)" = M(x) MQ). 
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If we define 
F(x) =bM(x)c, Gx) =bM(x), H(x)=M(x)Jc, forx € G1, 
then 
F(xy)=G(x)H(y) holdsfor x,y € G,. 

Fix fo,ro in S and take c = M(to)~'h(to) and b = g(ro)M(ro)~!. By the 
definition of M, we have, for s € S, h(s) = M(sty') -h(to) = M(s)c and 
H|S = h. Suppose st-! = r—'u forr,s,t,u € S. Thenrs = ut, f(rs) = 
g(r)h(s) = f(ut) = g(u)h(t), and h(s)A(t)~! = g(r)! g(u), so that M(r~!u) = 
M(st—!) = h(s)h(t)~! = g(r)! g(u). As before, M(r~!u) = M(r)~!M(u). Now 
g(u) = g(ro)M (ro) |-M(u) = bM(u) for uu € S and G|S = g. Finally, fors,t € S, 


f(st) = g(s)h(t) = bM(s)- M(t)c = bM(st)c = F(st) and F|SS = f\SS. 


This completes the proof of this result. 


We now extend functions satisfying the Cauchy and Pexider equations where 
one of the variables is contained in an arbitrary group G or a subsemigroup S of G 
and the other is confined to certain nonempty subsets C of G and the values of the 
functions are elements of a group H. 

Denote (C) = the subgroup generated by C. 


Result 1.79. (Mokanski [637]). Let G(-) and H(0) be groups and C a nonempty 
subset of G. Suppose f : G > H satisfies f (x-c) = f(x)of(c), forx €G, c EC. 
Then f(x -c') = f(x) o f(c’) holds for x € Gandc' € {C). 


Note that (C) is the maximal set for which f (xc’) = f (x)o f (c’) is valid, as can 
be seen from the following example. 
Take G = H = Z(+), C = {4}. Then (C) = 4Z. Set 


xe if x 1s even, 


FO) = : +1, ifxisodd. 


It is clear that f(x +4) = f(x) + f(4) for x € Z, 4 € C. Further, f(x + 4n) = 
f(x)+f (4n) forx € Z, 4n € (C). But8 = f(8) = f(5+3) - f(S)+fG) = 10. 
If (C) = G in the result above, then f(xy) = f(x) 0 f(y) forallx, y EG. 
Result 1.80. [637]. Let G(-) and H(0) be groups, S a subsemigroup of G, and 
CC S.ifg:S > H satisfies g(s -c) = g(s)o g(c), fors € S, c € C, then 
g(sc’) = g(s) 0 g(c’) is valid for s € S andc' € C’, the subsemigroup generated by 
C. Further, if G = SS~! and C' = S, then there exists a unique homomorphism 

from G into H such that $|S = g. 


The Aczél and Erdés result [45] can be deduced easily from this result also. 


Result 1.81. [637]. Let G(-) and H (0) be groups and S a subsemigroup of G such 
that (S) = G. Suppose f, g: G > H, h:S — H are such that f (xs) = g(x)oh(s) 
holds for x € G, s € S. Then there exists a homomorphism ¢ : G — H such that 
f(x) = ¢(4) 0b, g(x) = d(x) 04, h(s) = ay '(s) obforx €G, s € S,anda,b 


are constants in H. 
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Again the Aczél-Erd6s result [45] can be deduced from this result. 


1.7.4 Extension of the Logarithmic Equation 


Theorem 1.82. Let f : T — R, where T = Ror ]0,1] or [1,00[ or 
Ja, co[ (a > 0) satisfy f xy) = f(x) +f) forx, y € T. Then f has an extension 
L : R* => R satisfying (L). 

Proof. In the case T = R*_, L(x) = f (|x|) for x € R* serves the purpose. 

In the remaining cases, T is a subsemigroup of R* (-) such that R4_ = TT '= 
T~'T. Then, by Theorem 1.67, f has an extension (logarithmic) to IR*. and thus to 
R*. 

Without using this theorem, we give the following simple, direct proof for the 
case T = [1, oof. 

Let f : [1, oo[ > R satisfy the logarithmic equation (L). Define L : Ri. > R 
by 

f@), — forx=1, 
L(x) = 1 
(4), forx €10, 1). 


Clearly L (+) = —L(x) for x € R* and L(xy) = L(x) + L(y) for x,y > 1 and 
x,y €]0, 1]. 
Let x > 1 and y < 1. Two cases arise according to whether xy > 1 or xy < 1. 


When xy = 1, L(xy) = L(I) = f(1) =0=LO)4+L (+) = L(x) + LG). 
Let xy > 1. Then Lv) = L (xy : 1) = L(xy)+L (+) = L(xy) — L(y). When 
xy <1, LQ) =L (xy : 1) = Livy) +L (+) = L(xy) — L(x). The cases 


where x < | and y > 1 can be argued similarly. Thus L is logarithmic on R%_ and 
L| (1, col= f. 


As a consequence of Theorem 1.82, we obtain the following results. 


Theorem 1.83. A function f : [1, col > R satisfying 
(1.113) fin)=fO+Ff@, fort e[l, oof, ne Zi, 


satisfies the equation f (t*) = f(t) + f(*) fort, € [l,ool, m,n € Zi. 
Further, if f is continuous, then f (ts) = f(t)+ f(s) for all s,t € [1, col, and there 
exists a continuous logarithmic extension L :]0, co[ > R such that L(t) = f(t) = 
clogt fort € [1, of, where c is an arbitrary constant. 


Proof. Take any rational a in Jl, co[. Then m,n € Zi andm > n. Lett € J1, of. 
First, take t > n. Then f(t) = f (4-n) = f (£) + f(@), using (1.113), so that 


f (<) =fO-f@: £(F) = Foy - Fo, 
nN n 
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Further, 


m t t ‘ 
f (« “) = (- m) = (<) + f(m) (using (1.113) 
nN nN 
=fO-fa+ fo=sfoO+s (=). 
n 
Fort <n, there exists an integer k such that kt > n, and 
m m m 
f (=) = ren + 6 (F) = 104+ F045 (F) 
= f+ f(t). 
nN 


Thus f (¢-“) = f(t) + f (4) fort, * € [1, oof. 

If f is continuous, then f(ts) = f(t) + f(s), t,s € [1, co]. Further, there 
is a unique continuous L : Ri — R satisfying (L) such that L|[1, oo[ = f and 
f(t) =clogt, fort € [1, oof. 


Remark. [40]. If f is a strictly monotonic function satisfying (1.113), then also 
f(t) =clogt fort € [1, ov[. 


Result 1.84. The general solutions f, g,h :}0, 1] — R of the Pexider equation 
(1.114) f(xy) =s@)+h(y), forx,y €]0, 1), 


are given by f(x) = L(x) + a+b, g(x) = L(x) +4, h(x) = L(x) +), for 
x €]0,1], where L: Ri — R is logarithmic and a and b are arbitrary constants. 


Proof. Setting y = 1 and x = 1 separately in (1.114), we get f(x) = g(x) + a and 
f(y) =b+A(y), so that (1.114) becomes f(xy) = f(x) + f(y) — a —)b, which is 
logarithmic on ]0, 1]. The use of Theorem 1.82 yields the result sought. 


1.7.5 Exponential Extension 


Result 1.85. Let f : T > R*, where T = ]0, oo[ = R*. or Ja, ox[ (a > 0) satisfy 
f(x +y) = f(x)fQ). Then there is an E : R —> R* satisfying (E) such that 
E|T = f. 

Noting that T (+) is a semigroup of R(+) with R = T — T, use Theorem 1.67. 


Theorem 1.86. Suppose f : [0,1] — R* is such that 
(1.115) faety=f@FO) 


holds for x, y,x + y € I. Then there is an E : R — R* satisfying (E) such that 
E|l= f. 


Proof. We do the extension by two stages, first to [—1, 1] and then to R. 
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Define g : [—1, 1] ~ R by 


u for x € [—1, 0]. 


f(x),  forx e€ [0,1], 
s@)=y 1 
f(=x)? 


Note that g(—x) = AOE for x € [-1, 1], and gx + y) = g(x)g(y), for x, y = 0, 
with x + y € J andx,y < 0 with —1 < x + y < 0. Consider the case where x 
and y are of opposite signs; that is, say x <Oand y > 0. Then either x + y > Oor 
x + y <0. In the former case, 


gy)=gaty—-x)=g@t+y)-g(-x). 


In the latter case, g(x) = gix+y—y)=g(x+ y)g(—y). Thus g satisfies (1.115) 
on [—1, 1]. Now we extend to R. 


For t € R, define 
t n 
co-[Q 
n 


where 7 € [—1, 1], for some integer n. This definition is independent of n. 
Note first that g(nx) = [g(x)]" for x, nx € [—1, 1]. Suppose £ € [—1, 1]. Then 


m 
t t 
Ne an and m - as [—1, 1] and 


eG) = [eC am)] = [Ge =). 


Let t, u € R. Choose n sufficiently large that £, 4, 4“ € [—1, 1]. Then 


n’?n? n 


E(t-+u) = E (: - “yy - E (<)] [s(=)) =£0- ew, 


This proves the theorem. 


1.7.6 Multiplicative Extension 
Suppose / satisfies the functional equation 


(1.116) h(pq) =h(p)h(q)_ for p,q € 0, 1). 


If h(1) = 0, then h(p) = 0 for all p (take g = 1). So, let us consider h to be not 
identically zero. If h(po) = 0 for po € JO, 1[, then h(tpo) = 0 for t € J0, 1] and 
h(q) = 0 for g > po (choose n sufficiently large that g” < tpo). So, we can assume 
that 4 is nowhere zero and consider / :]0, 1] > R%* satisfying (1.116). 


Result 1.87. A function f : T > R*, where T = 0, 1] or [1, cv[ satisfying f (ts) = 
ff (s), for s,t € T, has an extension M : R*, > R* satisfying (M). If f is 
strictly monotonic, then f (t) = t© [43]. 

Observe that T (-) is a semigroup and R*. = TT~—', and apply Theorem 1.67. 
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Theorem 1.88. Suppose h : [1, cof > R* satisfies 
(1.117) h(tn) =h(t)h(n)_ fort € [1, oof, n =1,2,.... 


Then h (1%) = hA(ty)h (4) holds for t, € [1,0o[. Further, if h is continuous, 
then there exists an M : Ri, > R* satisfying (M) such that M|[1,co[ = f and 
M(t) = f(t) =¢°, fort € [1, oof. 

Proof. Let 7 € ]1, oo[. Then m > n. Choose t € [1, oo[ with t > n. Then h(t) = 


Om ane h(n) using (1.117), so that h (£) = #4, n(2) = Hm. 
OW 


h (1. =) = (<-m) =: (=) rom (by (1.117)) 
n n n 


h(m) 

= h(t). —~* =A(t)h (= iF 
h(n) 

As in Theorem 1.82, the identity above is valid for all ¢ € [1, o[. 

If h is continuous, clearly h(st) = h(s)h(t) for s,t € [1, co[. The rest follows 

from Theorem 1.82, Theorem 1.67, and Theorem 1.49. 


1.7.7 Extension of Derivations 


Result 1.89. [558]. Let P be an integral domain, F be its field of quotients, and K 
be a field extension of F. If f : P — K is a derivation satisfying (D), then there 
exists a unique derivation d : F — K such that d|P = f. 


Result 1.90. [558]. Let K be afield of characteristic zero, F be a subfield of K, B be 
an algebraic base of K over F, if it exists, and otherwise let B = ¢. Let f : F > K 
be a derivation. Then, for every functionh : B — K, there exists a unique derivation 
d:K — K such thatd|F = f, d|B=h. 


This result proves the existence of nontrivial derivations of R [558]. 


1.7.8 Almost Everywhere Extension 


P. Erdés in 1960 [260, 558] raised the following question. Suppose a function f : 
R — R satisfies (A) for almost all (x, y) € IR? (in the sense of Lebesgue measure); 
that is, f(x+y) = f(x)+/() is valid for all (x, y) € D with the Lebesgue measure 
of R?\D zero. Does there exist an additive A : R > R satisfying (A) forall x, y € R 
such that A = f a.e.; that is, for all x € D,, where the Lebesgue measure of R\ D, 
is zero? The answer turns out to be positive (see [136, 406]). Earlier Hartman [345] 
had proved that if f(x + y) = f(x) + f(y) holds for (x, y) € C x C, where the 
Lebesgue measure of IR\C is zero, then there is an additive A : R > R with A = f 
a.e. on R. 

This statement fails to hold for arbitrary measures [562]. Further, that the additive 
A need not be an extension of f was shown by C.T. Ng and J. Aczél based on the 
following a counterexample by V. Zirak Walsh (see [55, 44, 45]). 
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Take D = {(x, y): x, y,x+y 4 3, 6}U(3, 6). Obviously, the Lebesgue measure 
of R?\D is zero and D, U Dy U Do = R. Define 


j=) 


, forx € R\{3, 6}, 
f(x) = 43, forx =3, 
, forx =6. 


fon 


Clearly f is additive on D, but f is not additive on R. 

If a function f : R — C satisfies (E) a.e. on R?, does there exist an exponential 
E :R — C such that E = f a.e. on R? The answer turns out to be positive again 
(see [44]). 


1.8 Applications 


We now present some applications based on the functional equations studied so far. 
We will return to more applications later, in Chapter 17. We start off with two exam- 
ples from economics given by W. Eichhorn [250]. 


1.8.1 Economics 


Example 1. Price and Advertising Policy 


Consider a firm making a single product. It is natural to assume that the sales S 
depend on the price p of its product and on the advertising expenditure 4 (a simple 
model); that is, S: Ry x Ri > R, S = S(p, A). What properties has one to impose 
on S$ to determine S? It is normal to expect the sales to go down with an increase 
in price and go up with an increase in advertising expenditure. Keeping these in 
mind, the following postulates were given by Eichhorn [250]. Let the sales function 
S satisfy 


(Cy) S(p+q,4) = w(q,4)S(p,4), forp,qgée Ri, 4€ Ri, 


where y : Ry x Ri > R with y(0,w) = 1 for w > 0, S(O,1) > 0, and 
y(-, A) is a (nonconstant) decreasing function in the first variable (for fixed ); that 
is, p > w(p, 4) is a nonconstant decreasing function. 

(C,) says that the sales at price p + q (for an increment q in price) equal the 
previous sales times a real number < | that is a nonincreasing function of q; that 
is, for fixed 2, S(p, 4) considered as a function of p is convex and decreasing. The 
assumption S(O, 1) > 0 implies there are sales if the price is zero and the advertising 
expenditure is 1. 


(C2) S(p, Au) = S(p,4)+T(p, hn), forpe Ry, 4,4 € Ri, 


where T : Ry x Ri. > Rwith T(p, 1) = Oforall p > Oand T(p, -) is an increasing 
(nonconstant) function of the second variable (for fixed p); that is, 4 > T(p,2)isa 
nonconstant increasing function. 
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(C2) says that a multiplicative change in advertising expenditure results in an 
additive change in sales; that is, for a given p, S considered as a function of / is 
concave and increasing. For instance, sales increase with an increase in advertising 
expenditure. 

Now we determine S subject to the conditions (C;) and (C2). For fixed 2, (C1) 
is a Pexider equation (PE) of exponential type. Hence 


S(p, 2) = a E(p, ‘), w(p, ‘) = E(p, -), 


where E(p, -) satisfies (E) on R and aj is a constant depending on 1. Since y(p, -) 
is monotonically decreasing in p, so is E(p, -) and E(p, A) = e?!?, Thus 


(Ci) S(p, A) = ar e"™™?, w(p, a) =e! MP, ai(A) = SO, 2). 


For fixed p, (C2) is a Pexider equation (PL) of logarithmic type. Thus, there 
exists an L satisfying (L) on R% such that 


S(,4)=LQ,-)+b2, T(,4)=L(,:). 
Since T is monotonically increasing in A, L(A, p) = a2(p) log A, so that 
(C22) S(p, 4) = a2(p) log 4 + ba(p), T(p, 4) = a2(p) log 4, 
with a2(p) > 0, b2(p) = S(p, 1). 
In order to determine S, either a;(4) and b,(A) or a2(p) and b2(p) are to be 


obtained. We seek a2(/) and b2(A). 
Substitution of S from (C1) in (C2) gives 


ay(Ap) «eM? = ay(A)eM? + an(p) log 
=a,(uje!? + ax(p)logA (interchanging / and 12). 


With p = 0, we obtain aj (A) — a2(0) log 2 = ai(“) — a2(0) log w so that ay(A) = 
alogda + b (by fixing yw), where a and b are constants and 


S(p, 4) = (aloga + bye? , 
with b > 0, since S(O, 1) > O anda = a2(0) > 0, b = b2(0) > 0 (put p = 0 in the 
equation above and (C2)). 
Substitution of S from (C22) in (C1) gives 
ar(p + q) log 2. + ba(p + 4) = [a2(p) - log 4 + ba(py]e 4. 
With p = 0, we have 
(1.118) ar(q) log 4 + ba(q) = (alogd + bye, 


which by setting 4 = 1 gives b2(q) = be?!“7, Putting 2 = 2 and 4 separately in 
(1.118) and eliminating a2(/) from these equations, we get 


70 1 Basic Equations: Cauchy and Pexider Equations 
b b 
(alog2 + bye?!@4 — | alog2 + — } ce! O9 — e219 — 9, 
2 2 


Suppose b1(1), b1(2), and b;(4) are distinct. Then {e?'1)?, e61@P, ei?) is lin- 
early independent and b = 0 = a, contradicting a > 0 and b > O. If two of 
bi(1), 51 (2), and b;(4) are equal, since a,b > 0, all b}(1), bi (2), and b; (4) are 
equal. 

Now 4 = 2 in (1.118) gives a2(q) = ae “4, where c = —b,(1), and this a2(q) 
in (C22) yields 


(C12) S(p, 4) = (aloga + b)e~”. 


Since y is decreasing, from (C;) it follows that c > 0, the sales function S is given 
by (C12), and the arbitrary constants a,b,c are positive. Thus we have proved the 
following. 


Theorem 1.91. (Eichhorn [250]). The sales functions S : Ry x Ri. > R satisfying 
(C1) and (C2) are of the form (C12). 


Graphs of (C12) for p variation and 4 variation are given by 


S(p, 4) S(p, A) 


0) Pp 1 A 
p— (a+blogdaje"? A— (a+ blog dye"? 
indicating a decrease in sales with an increase in price p and an increase in sales with 


an increase in advertisement expenditure 2. Note that basically we are using (E) and 
(L) equations. 


Remark 1.92. (Eichhorn [250]). S given by (C12) satisfies (C;) and (C2). For this 
sales function S, there is a maximum profit function P given by 


where the cost function K : Ry — R%_ satisfies the Pexider equation 
K(x+y)=K(x)+ NO), forx, y € Ri, 


with N not constant and N : Ry > Ry. 
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The cost function K (x) = a; + b,x with a; > 0, b; > 0, and the profit function 
P is given by 


P(p,4) = (p—bh)(aloga + bye“? — ay — 2. 


The maximum profit occurs at (p*, 1*) = (1 + by, te-!-eb) : 


Example 2. Production, Technical Progress 


A production function describes how the amount of production depends on several 
factors, such as capital, labour, raw materials, etc. We consider the production func- 
tion (given in Eichhorn and Kolm [251], and Eichhorn [250]) ® depending on capital 
k, labour @, and time t, which is also known as the index of the state of technology; 
that is, D: RS — R,4.A basic assumption on @ is that there is at least one vector 
(k*, €*), k*, €* > O such that 


(p1) t—> O(k", £*,t) 


is a nonconstant increasing function for t > 0. This says that the output O(k*, €*, r) 
for a fixed input vector (k*, €*) increases with time; that is, with technical progress. 


Output augmenting. Technical progress is output augmenting if there exist ®; : 
Ry —> Ri, a, : Ry — Ry, such that 


(p2) O(k, ¢, t) = a1@)O1{k, €). 


It is clear that a is a nonconstant, increasing function for t > O (use (pj)). 


Labor augmenting. Technical progress is said to be labor augmenting provided 
(p3) O(k, £,t) = O2(k, a2(4)€) 


for some non-decreasing functions ®2 : Ry —> Ri, ag: Ry Ry. 


Capital augmenting. Technical progress is capital augmenting provided 
(pa) (Kk, €,t) = O3(a3(“)k, &) 


for some non-decreasing functions ®3 : Ry —> Ri, a3: Ry Ry. 

Suppose © : RS — R, possesses the properties (p;) to (pa) with O2, ®3 nonde- 
creasing and a2, a3 are continuous, strictly increasing functions from a2(0) = 0 = 
a3(0) to infinity as tf — oo; that is, the production function ® is simultaneously 
output, labor, and capital augmenting. Then 


(1.119) @1(k, €) = clk?, O(k, ft) =alt)k?e“, 


where c, C1, C2 are positive constants. 
Thus ®, is the Cobb-Douglas [167, 250] function and ® is the Cobb-Douglas 
function with respect to k and @. 
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By hypothesis, 
(1.120) O2(k, a2(t)€) = a1 ()O1(k, €) = O3(a3(t)k, €). 


Since @ is a strictly increasing, continuous, onto function, for s > 0, there is at 
such that a2(t) = s. Replacing t by a (s) in (1.120) (the first part of the equality), 
we have 
Oo(k, s€) = m(s) 1 (k, €), 
where m(s) = a1 (a. '(s)) is a nonconstant monotonically increasing function. 
Now, for s,t > 0, we get 


O2(k, ts€) = m(st)® 1 (k, €) 
= m(t) D1 (k, s€) = m(s)®_(k, té). 
Setting s = 1| in the last equality gives m(1)®,(k, t€) = m(t)®1(k, €) andm(1) 40 


since otherwise 0;(k, €) = 0 = ®2(k, C), contradicting ®2 being a nondecreasing 
function. Hence 


Noi = aoe anG oe” Hee, 
m(1) 
Hence m(st) = m(s) 20. and —7_ is multiplicative (satisfies (M) on R) and 


; : /m(1) m(1) : 
monotonically increasing, so m(t) = bit“! with cy > 0 and 


O7(k, t€) = byt 1 (k, £) = dC" Dy (k, t); 


that is, 
@,(k, €) = €" O,(k, 1). 


Similarly, by considering the second part of (1.120), we get 
@1(k, 0) =k? ®, (1,6), co > 0. 
Thus, ®1(k, €) = €"@,(k, 1) = €CK2 0,1, 1) = c€'k®. This proves (1.119). 


1.8.2 Area 


Example 3. Area of a Trapezoid 


The area of a trapezoid is a function of the parallel sides and the height, say 
f(b, b’,h), and f is anonnegative function from RS — R, to be determined. 
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The desired properties of f are 


(ti) f+ bi, b+ bi, h) = fo, bh) + fbi, 5), h), 
(tz) f(b, b, hy +hz) = f(b, b,h1) + fb, h2), 
(t3) f(b, b',h) = f(b’, b,h). 


From (t;) we see that, for fixedh, f satisfies (A) on Ry and, by Theorem 1.24, 
fb, b; ) = Aid, ‘) Se A2(b', ‘), 


where A, and Az are additive in the first variables. Since f is nonnegative (bounded 
below), A,(b, h) = a1 (h)b and A2(b’, h) = a2(h)b’, so that 


f(b, b’, h) = ay (h)b + az(hyb’. 


To determine a, and a2, by the symmetry property of (t3), a1(1) = a2(h) = a(h) 
(say). Substitution of this f in (tz) yields 


ath, +h2)2b = a(h,)2b + a(h2)2b; 
that is, a satisfies (A) on R, and is nonnegative. So, a(h) = ch and 
f(b,b’,h) =c(b+b*jh, 


where c is a positive constant depending on the unit of area (refer to [144]). 


Example 3a. Area of a Rectangle 


The area F(x, y) of a rectangle with sides x and y satisfies the equations 


Dv 
ee ed ee 
KG Xe u Xx 
F(x, y) F(x +u, y) F(x,y +0) 


F(x +u,y)=F(x,y)+Ftu,y), 
F(x,y+v) = F(x, y)+ FQ, pd), x,y,u,0 ER}. 


Since F is additive in the first and second variables and F > 0 and F(x, y) = cxy 
with F(1, 1) = 1, the area is F(x, y) = xy (see the interest formula in Chapter 17). 
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1.8.3 Allocation Problem: Characterization of Weighted Arithmetic Means 


Example 4. Allocation Problem: Characterization of Weighted 
Arithmetic Means 


A group decision-making or allocation problem treated by Aczél, Kannappan, Ng, 
and Wagner and others [63, 64, 53, 16, 825, 458] is the following. A fixed amount s 
of quantifiable goods (money or other resources, such as energy, raw materials, etc.) 
is to be allocated or distributed (completely) to a fixed number m (> 2) of competing 
projects or applications. A committee of n assessors, advisors, or decision makers 
is formed for this purpose. Each member of the committee recommends how the 
funds should be divided or allotted. The jth assessor recommends that the amount 
xi; be allocated to the ith project in order to establish the “consensus allocation” 
(recommendation of the chairman) fj(x;). The jf; may, a priori, all be different. 
Of course, for the chairman and each advisor, the assignment to each project should 
sum to s. This can be presented in the table 


Project 
Assessor 


Chair — 


Aggregate fies) f2(x2) sore fii) poe fin(%m) s 
Allocation 


where xj = (xj1,%j2,---,Xin) (ith column vector). The problem is to determine 
fi subject to certain constraints. In general, the solutions are weighted arithmetic 
means. We now prove the following result from [53], which also shows that the 
conditions 

min{y;} < fOQ1, y2,---5¥n) < max{y;} 


and 


k k 
2 = s(7 = 1s.) Pf Gitss xy) =s 
i i=1 
(s and k > 2 are fixed) for f : [0,s]” — R characterize the weighted arithmetic 
means. Denote s = (s,5,...,5), 0=(0,0,...,0), 1=(,1,...,1). 
Suppose fj : [0,s]” > [0,5] @ = 1,2,...,m) satisfy 


(i) f;(@) = 0 (consensus of rejection; that is, if all referees assign amount 0 to the 
ith project, then the aggregated allocation should also allocate amount 0), 
(ii) fi(x) = O (aggregated allocation is nonnegative) (for x € [0,r]”, 0<r<s), 


and 
m 


(iii) aH =s=> > fils) =s. 
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n 
Then f/f] = f2 =--- = fm and there exist weights w; > 0 such that >> w; = 1 and 
j=l 


n 
Fily) = iQ, y25---5¥n) =D wjyj/G = 1,2,...,m) 
j=l 
for y € [0,s]”. 


Setting x; = 0 for (@ 
Similarly, fi(s) = s fori = 


rewrite (iii) as 
m m 
DAGI+A (:- >) =%, 
i=2 i=2 


and then substitute x3 = 0 =--- =X, x2 = x to get 


2,3,...,m) in (ili) and using (i), we get fi(s) = s. 
1,2,...,m (consensus of overwhelming merit). First 


fis—x)+ fox) =s 


and then x2 =x, x3 =y, xj =0 (i =4,...,m) to obtain 


fiis-—x—-y)t+ f)+ AO) =s: 
that is, 
fa@w +y)= fox) + fa) forx,y,x+y €[0,s]”. 


Finally, we obtain 


f@+y) = fa)+ fp) forx,y,x+y €[0,s]”. 


(Set x = O in the above to have f2(y) = f3(y). This also shows that all f; @ = 
1,2,...,m) are equal.) Now f3 is additive on [0,s5]” and is nonnegative. By the 
extension result, 


nl 
AO) = >) wjyj, wj 20, for y € [0,s]”. 
j=l 


n 
Since f3(s)=s, ) wj = 1. 
j=l 
This proves the result. 


Linear Programming 


The result above extends to constraints consisting of a system of linear equations 
and inequalities. This may be of interest for multiobjective linear programming as 
an alternative to Pareto optimization. We now present the result of Rad6 and Baker 
[676]. 
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Consider a linear programming problem with constraints of the form 


Pp 

(Lda e,et, Galen) apt > agap ea CH 12.,m=—p), 
j=l 

where J}, ..., /m are given proper open intervals, 2 < p < m, d,...,@m—p are 

given reals, and [a@;;] is an (m — p) x p real matrix having in each column at least 

one and in each row at least two nonzero elements. 

The problem is the following. Suppose that n individuals select solutions of 
(1.121), each in accordance with his own criterion. Find a consensual solution of 
(1.121) such that the consensual value of each variable depends only on the values 
assigned by individuals to that variable, the criteria being arbitrary. In other words, 
determine m functions (said to form an aggregation method with respect to the sys- 
tem (1.121)) 

da >IT, (G=1,2...,m) 
in such a way that 
P 
(1.122) zj el, Zp t > @ijzj=ai G=1,...,m—p) 
j=l 

implies 

Pp 

fp+i@p+i) + > ri) =a, (=1,...,m—p). 

j=l 
Let Cj (j = 1,...,m) and C be the sets of those z; € IR” or (z1,...,Zp) € (R")?, 
respectively, that can be completed as a solution of (1.122). Note that C is an open 
convex set and C; are n-dimensional intervals. 


Suppose C # - and f; is bounded above or below on some nonempty open 
subsets of I; (j =1,..., p). Then f; are given by 


fj@) = > wexe + Bj G=1,...,m), 


k=1 


where wx and #; are real constants subject to 


p n 
Bri +>) aij8; — (: - ay G=1,...,m—p). 
k=1 


j=l 
1.8.4 Sum of Powers of First n Natural Numbers and Sum of Powers on 
Arithmetic Progressions 
Sums of Powers of Integers 
There are many ways of finding the sum 


Sen) = 1 +2" 4..-+n*, keEZi, ne Z*, 
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that were developed by several fascinated mathematicians. The methods adopted 
were varied in nature—evaluation atn = 1, symmetry argument, modified Bernoulli 
number identity, generating functions, etc. Here the focus is going to be on the use of 
functional equations, especially the additive Cauchy equation (A) on Z*., to obtain 
the sum. We will illustrate this in the cases k = 3 and 4; that is, we will find S3(n) 
and S4(n) assuming the sums S;() and S2(n) (these could also be obtained by the 
method to be presented). The general references ([16, 21, 122, 142, 517, 543, 607, 
664, 758, 759, 836]) are only a partial list. 


The sums S;(7) satisfy the system of functional equations 


k 
(#) Sem +n) = Sm) + kn) +> (j )n'si-som. keZy4, mne Zt 


i 
i=l 
(use binomial expansions in S,;(m + n) after the nth term). Note that So(n) = 
n, Sx(1) = 1, and we are assuming S;(n) = Meh) So(n) = SESE aaee 
For k = 3, («) yields 
S3(m +n) = S3(m) + $3(n) + 3nSp(m) + 3n7S1(m) +n? So(m); 


that is, 


4$3(m +n) = 4$3(m) + 4$3(n) + {4m3n + 6m2n? + 4mn>} 
+ {6m7n + 6mn7} + 2mn 
= 453(m) + 483(n) + {(m +n)* — m4 — 4} 
+ 2{(m +n)? — m? — n°} + {Qn +n)? — m? — n°}, 


which can be put in the form 
Ai(m+n)=Ai(m)+ Ai(n) form,n € Z%, 
which is the additive Cauchy equation, where 
A(n) = 4$3(n) — n4 — 2n? =n’. 
Evidently, A; (7) = c3n, so that 
4$3(n) =n* +2n?+n?+c3n, ne tsa 


: = . _ _ (nt)? 
Setting n = 1, we obtain cz = 0 and $3(n) = —{—. 
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Now we will find S4(n). For k = 4, (*) gives 


Sa(m +n) = Sa(m) + Sa(n) + 4n$3(m) + 6n*S2(m) 
+4n? Sy (m) + n*So(m); 


that is, 


5S4(m +n) = 5S4(m) + 5S4(n) + {5m4n + 10mPn? + 10m?2n3 + 5mn*} 
+ {10m3n + 15m?n? + 10mn} + {5m2n + 5mn?} 
= 5S84(m) + 5S4(n) + {(m +n) — m> — n?} 


5 5 
+ 5{(m +n) —m* nh + (mtn)? —m* =n}, 
which goes over to 
Ao(m+n) = Ao(m)+ Ao(n) form,n € Z%, 


where 
Ss) 5 
Ax(n) = 5S4(n) — n? — =n* — =n?. 
2 3 
Thus 5$4(n) = n+ 3n4 + 3n +cqn forn € Zi. As before, = | gives c4 = — 
and S4(n) = yq[6n> + 15n4* + 10n3 — nn] = Hn + 1)Qn + 1)Bn? + 3n— 1). 


uy 
6 


Remark. The general solutions s;, : N > R (k = 0,1,2,...) of (*) are given by 


[16], 
t Soyeed 
Sy(n) = —— ae, Kee AF.) 
k+1 £ 
€=0 
where co, C1, C2,... are arbitrary constants in R. 


The constants c; satisfy 


k-1 


k 
(Jak (KS 1,2,3,.<.). 


We get 


co= 1 So(n) =n 
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co + 2c; =2 cj = 


i 
Si(n) = 5 (con® + 2cin) 


1 2 
=o +n) 
_nin+1) 
= 5 ; 
co + 3c) +3c2 = 3 o= 


1 
Sy(n) = 3 (con + 3cin? + 3c2n) 


1 1 1 
=> gn + 5n + Fa 


= nin ty, 


co + 4c; + 6c2 + 4c3 = 4 c3 = 0 S3(n) = [con* + 4cyn? + 6c2n?] 


i t” te 
2d Bn(x) nt = Za 
where B, = B,(1). 
Now we will turn our attention to the sum of powers on arithmetic progressions: 


Si(n; a, h) = ak +(ath)'+.--+fat(n—-Dal, fork e Z4,n€Z%, a,heR. 


By adopting the method used above, we will determine S2(n; a, h) by assuming 
Si(a;a,h) = Fn? + (a _ f) n and noting that So(n;a,h) =n, Sx(1;a,h) =a". 
First of all, S;,’s satisfy the system of functional equations 


k 
k , 
(ax) Se(m +n; a,h) = Sx(m; a,h) + Sx(n; a, h) + > (7) com’ sia a,h) 
L 
i=l 


fork € Z,, m,n € Zi, a,h € R, which fork = 2 results in 


So(m +n; a,h) = So(m;a,h) + So(n; a, h) + 2nhS\(m; a,h) + (nh)? So(m; a, h) 
= $)(m; a,h) + Sx(n; a, h) + h?{m?n + mn?} 


= MY .2 
a->s -2mn 


h2 
= S2(m;.a,h) + So(n;a,h)+ = l(m +nyp =n =n} 


+h (« - >) {(m + ny —m* —n"}, 
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from which we obtain 


A(m+n) = A(m)+A(n) form,n € Z*, 


where 


h? / 
A(n) = S2(n; a,h) = h (« 5)" 


Hence, $2(n; a,h) = wn +h (a _ 4) n? + cn. Using S2(1; a, h) = a”, we have 
c= qa? _h (a _ 4) and $2(n;a,h) = Wni+h (a _ a) n+ fa? —ah+ eI n 
(see [21, 758]). 

Sx (n) can also be given by 


where B,, is the Bernoulli number defined by the generating functions. 


Remark. The general solutions s, : N > R (k = 0,1, 2,...) satisfying (*«) are 
given by [21] 


k 

1 k+1\ . : 

sg (n; a, hh) = —— : s ni leg hd (néZ*), 
See ar 1 


where co, cj ... are arbitrary constants in R with 


k k 
> ( ow shi =ka‘-! (k=1,2,...) 
ja 

2 


co=1, 2cy+coh =2a, 3c2+3cyh + coh? = 3a’; 


that is, 


1 
co=1l, 2c =2a-h, 3c2= 3a> — 3ah + she. 


1 2 h » h 
si(n;a,h)= 5 Gnei +n°coh) = rie + fa- 5 n, 


1 
s2(n;a,h) = 3g Gne2 + 3n>cjh + n>coh’) 


h3 in) h ae hate 
= —7 nla——]n a~ —an SN cs 
3 2 6 
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1.8.5 More Sums Using the Additive Cauchy Equation 


Not only are we interested in determining the sum of (i) powers of integers and 
(ii) powers on arithmetic progression, but also other summations of series. Here 
we give an example of finding the sum of the product of integers using the Cauchy 
equation, namely 


t(n) =1-2-342-3-44---+n(n+1I(4+2), ne Zi. 
As before, first we start with 
t(m+n)=t(n)+(14+1)24+2)(1 +3) 
+---+(n+m)(n+m-+1)\(n+m +42) 
=t(n)+mn? +n?64+9+4---+3(m+1)) 
+n {IL +264+--+-+ Gm? + 6m +2)} + ¢(m) 
|| 
2 


1)Q2 1 1 
m(m + x m+ 46. mnt ” + 2m 


= Hn) +1(m) + mn’ + 30? | 
+n[3 
3 3 3 22 
=t(n)+t(m)+ 4mn> +m a amen 
Ee 9 4 a2 2 en 
=m*n + =mn —mn 
2 2 2 
1 
=1(n) + 1(m) + 21m +n)! = mt =n") 


3 11 
+ alm + n)> =m =n") + aim + n)* =m =|, 
which results in the additive Cauchy equation 


A(m+n)=A(m)+A(n), m,neé Z5, 


where ; : rT 
A(n) = t(n) — 7” - ial - re 
Thus i . 4 
t(n) = qn + sm + =" +cn. 
With n = 1, we get 
be 1 i 3 ‘. 11 ae. 
a aie ee 


that is, c = 3, so that 
1 3 11 3 
t(n) => zy + 3” + = + 7" 


=Fin+ Dt 2043). 
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1.8.6 Application in Combinatorics and Genetics 


We are all familiar with the combinatorial formula ("") = AO = the 
number of ways of selecting r things at a time out of n things. We will find G i 
by using the additive Cauchy equation (A). Define f;(7) = the number of possible 
ways of using additive Cauchy equation (A). Define f;.(m) = the number of possible 
ways of picking r things from n things. 

In genetics, it is of interest to know the combinatorial function g;(”) = the num- 
ber of ways to pick r things out of 1 things, this time permitting repetitions. We will 
determine /3(n) and g3(n). 

Note that /3(3) = 1, g3(1) = 1 (there is only one way one can take one thing 
thrice), fij(n) = n = gi(n). Let us assume fo(n) = — n=) and go(n) = antl) ; 
which can also be obtained by the method presented below (see Snow [759]). 

First of all, it is easy to check that f,.(m) and g;(n) satisfy the system of function 
equations 


r-1 


(1.123) t-(m +n) = t,(m) + t-(n) + >) ti(n)t,—i(m) 
i=l 


form,n,k € Z%. 
To find f3(1), take r = 3, t3 = f3 in (1.123) to get 


fam +n) = fa(m) + far) + fi) fam) + fot) film) 
m(m =) se att! —1) 
= f3(m) + fan) +n——— a — 
= f3(m) + fa(n) + smn + mn’) —mn 
= falrm) + fala) + 2llom bm) m3 — 3] — SH m-tn)?—m? =n? 
which can be rewritten as 


(A) Ai(n+m)=Aji(n)+Ai(m) form,ne Z%, 


where Aj(n) = f3(n) — gn? + 5n? 
Since Ai(n) = Cin, 


1 1 
Ba) = = oe +cn 


6 3 
1 1 1 
el = 5” +n (use f3(3) = 1) 


_ntn—I(n—2) _ (n 
= Ho=DO=D (1) 
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In order to determine g3(n), take r = 3, f3 = g3 in (1.123) to obtain 


g3(m +n) = g3(m) + g3(n) + gi(m)g2(m) + g2(n)gi(m) 
m(m + 1) n(n + 1) 
—— A) 


= g3(m) + g3(n) +n- : 
= g3(m) + g3(n) + lim +n) — m3 —n7] 


2 2 
—n), 


1 
+ slim +n)? —m 
which goes over to 
(A) A2(m +n) = Ar(m) + Az(n), m,n e Z%, 


where A2(n) = g3(n) — en? = zn. 


As before, since A2(n) = c2n, 
(n) = Lg, | ag 
83 a Tae +con 
1 1 1 
= an + 5h + a (use g3(1) = 1) 


1 
= gnu + 1)(n+2). 
Remark. Even though the system is the same for both f,(”) and g,(), the solutions 


differ because of different boundary conditions. The beauty is that the same method 
works (refer to Aczél [16, 21], Snow [758, 759]). 
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Matrix Equations 


This chapter is devoted to functional equations whose domain, domain and range, or 
range are matrices. We first consider the matrix version of the Cauchy equations (A), 
(E), (L), and (M). Then we treat the matrix version of cosine equation (C) and other 
generalizations. 

Let M,(F) (n => 1) denote matrices of order n (n x n matrices) over a field F 
(could be R or C). Indeed, M,(R) = R”, M,(C) = cr”, and GL(n, F) is the set 
of all nonsingular (regular) matrices of order n over a field F. 

For P € M,(R), 


det(P — AI) = (—A)"In + (—A)" "Si (P) + A" 7 So(P) + +++ + Sn(P) 


is the well-known characteristic polynomial of P, where J, is the identity matrix of 
order , and det stands for the determinant. It is true that the characteristic polyno- 
mial is invariant (under conjugacy) with respect to nonsingular matrices; that is, 


det(C7!(P — AI,)C) = det(P — AIn), P,C € M,(R), 


for every nonsingular matrix C. This implies that the functions $;(P), S2(P),..., 
S,(P) are invariant under conjugacy, where S; : M,(R) > Rfork = 1,2,...,n. 

S\(P) is called the trace of P (tr P) and is the sum of all eigenvalues of P and 
satisfies the additive functional equation (A), 


Si(P +S) = S8,(P)+S,(S), for P,S €¢ M,(R). 


Sn(P) is the product of all eigenvalues of P and is the det P and satisfies the 
multiplicative equation (M), 


Sn(PS) = Sn(P)-S,(S), for P, S € M,(R). 
The function $2(P) satisfies the quadratic functional equation (Q), 


So(P + S) + S2(P — S) = 2$2(P) + 2$2(S), P,S € M,(R). 
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Some further results concerning the functions $;(P), S2(P), and S,(P) are con- 
sidered here. In particular, we treat the matrix-additive (mA), exponential (mE), 
multiplicative (mM), and logarithmic (mL) functional equations 


(mA) A(P + S) = A(P) + A(S), 
(mE) E(P + S) = E(P)- E(S), 
(mM) M(PS) = M(P)M(S), 
(mL) L(PS) = L(P) + L(S), 


where A, E,M,L: M,(F) > M;(F) forn,k > 1, P,S € M,(F). 


Definition. A mapping Q: M,(F) > My(F), where F = R or C, is said to be 
unitarily invariant provided 


Q(U*PU)=Q(P), PeéM,(F), 


holds for every unitary matrix U. 
Q. is said to be orthogonally invariant if 


Q(O'PO)=Q(P), for P € M,(R), 


holds for every orthogonal matrix O € M,,(R) and O’ is the transpose of O. 


Result 2.1. (Kurepa [575]). Let T, ®, and Y be unitarily invariant functions that 
are defined on M,(C) > M,(C). Let 


(2.1) T(P + S) = G[®(P)®(S), ¥(P) + ¥(S)] for P,S € M,(C), 


where G : Myn(C) > My(C). 


If we denote by a a matrix of order n with matrix elements 
(a1)ij = O01; GJ=1,2,...,n), 
and if we define f, 6, yw: C + Mx(C) by 


flu) = Flay), @U) = Pay) and yu) = Pa), 


then 
(2.2) fu +o) = Glew), yw) + v)I 
and 

(2.3) TRS fEP) 


for every pair of complex numbers u, v and for every matrix P. 


Now we obtain the unitarily invariant solutions of (mA) and (mE). 
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Theorem 2.2. [575]. Let T, be a unitarily invariant function and satisfy (mA) from 
M,(C) to Mm (C), 


(mA) Ty (P +S) =T1(P)+T1(S), 
for every pair of matrices P and S. Then 
Ty(P)= fir P), PeMn(C), 
for all P. If the function T is continuous, then 
Ti(P) = a1Re tr P + azIm tr P 


for every P, where a, and az € M,,(C) do not depend on P. 


Proof. If in Result 2.1 we put 0 = 0, WY = T, and G(0O,u) = u, then we get 
F\(P) = fi(tr P), where the function fj : C ~ M,,(C) satisfies the functional 
equation (mA) fi(u +0) = filu) + filo). 

The continuity of 7; and the fact that tr P is a continuous function of P imply 
that f|(w) is a continuous function. It is well known that a continuous function that 
satisfies the functional equation (mA) has the form 


fil) = ajReu + agimu 


or 
Ti(P) = a,Re tr P + azIm tr P 


for every P. 


Theorem 2.3. [575]. Let Ty be a unitarily invariant function from M,(C) to My (C) 
and let 


T)(P + S) = To(P)T2(S) 


hold true for all P and S in M,(C). 
Then 


T(P) = faltr P), 
where the function fz :C — Mm(C) satisfies the functional equation 
(mE) fu + v) = rw) fre) 


for all complex numbers u and v. 
If det T2(P) 4 0 for every matrix P and if T2(P) is a continuous function, then 


T)(P) = exp[ajRe tr P + iazIm trP]. 
Proof. If in Result 2.1 we put ¥ = 0, G(u, 0) = u, and ® = T, then we get 


T2(P) = fo(tr P), 
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where the function f2 satisfies the functional equation (mE). Since det 72(P) 4 0, 
we have det f2(u) 4 0 for any complex number u. The continuity of 7>(P) and the 
continuity of tr P imply that the function f> is a continuous function of u. From 
(mE), we find 


(2.4) fit +is) = frat) his) = falis) pW 


with tf = Re u and s = Im wu. Further, 


fot +s) = fat) fals) 

and 
fadit tis) = falit) fois) 
for all real numbers ¢ and s. Thus 
fa(t) = exp(ait) 
and 
fo(is) = exp(ia2-s) 
for all real numbers t and s. Thus, from (2.4), we have 
expayt - expid2s = expia2s - expayjt 
for all real numbers ¢ and s, where a; and az € M;(C). This implies that a; and a2 
commute. Thus 
fou) = fo(t tis) = exp(ait) - exp(iazs) = exp(ait + ia2s) 


or 
To(P) = exp(aiRe tr P + iazIm tr P) 
for every matrix P. 


Result 2.4. [575]. Let T, and T> be orthogonally invariant functions from M,(R) 
to M;(R). If T, satisfies (mA) and T> satisfies (mE) for all matrices P, S in M,(R), 
then 

Ti(P)= fitrP) and 1)(P)= foltr P), 


where f\ and f2 are functions of real variables to My (R) and satisfy the correspond- 
ing functional equations 


AG+tyY=fAiG)+fAdO) A@+y=h@hAO), forx,yeR. 
Now we consider the remaining two equations (mL) and (mM). 


Result 2.5. [575]. Let T, ®, and ¥ be unitarily invariant functions that are defined 
on M,(C) and have values in M;(C). Let 


(2.5) T(P -S) = G[®(P)®(S), ¥(P) + ¥(S)] 


hold true for all matrices P and S, where G - My (C) > M;(C). 
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Let a (u) denote a diagonal matrix with matrix elements (a\(u))jj = 1if j #1 
and [a,(u)]11 = u. If we set 


fu) =Tlai@)|, gu) =Plai@)], and ylu)=Vlai@)], forweR, 
then we get 
(2.6) f(uv) = GIg@)gdr), yu) + y(r)] 
for any couple of real numbers u, v, and 
T(P) = f (det P) 
for every matrix P. 


Theorem 2.6. [575]. Let T3 be a function that is defined on the set of nonsingular 
matrices of order n (M,(C)) and takes values in My(C). Let 


(2.7) T3(PS) = T3(P) + 73(S) (mL) 
for all nonsingular P, S € M,(C). Then 
T3(P) = f3(det P), 
where the function f3 :C — Mm(P) satisfies the functional equation 
(2.8) f(uv) = fatu) + fae) (u,v (40) EC), 


which is logarithmic. 
If T3(P) is a continuous function, then 


T3(P) = a log | det P| 
for every nonsingular matrix P, where the matrix a € My(C) does not depend on P. 
Proof. For P = S = Jy, (2.7) implies T3(/,) = 0. This and (2.7) lead to 

T3(S~! PS) = T3(P) 


for any nonsingular matrix S. Thus 7>(P) is a unitarily invariant function. In the 
same way as in Result 2.5(® = 0, YW = T, G(0,u) = u), we get 73(P) = 
f3(det P), where the function f3 satisfies the functional equation (2.8). We thus 
have that a continuous function /3 that satisfies (2.8) has the form 


(2.9) fu) = alogu 


for u > O (see Theorem 1.2 and Corollary 1.22). Suppose that u = exp(it) 
and v = exp(is), where tf and s are real numbers. Then the continuous function 
h(t) = f3(expir) has the properties 


h(t+s)=h(t)-h(s) and h(t+2z)=h(t) 
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for all real numbers ¢ and s. Thus 
(2.10) h(t) =0 
for all real numbers t. Now (2.8), (2.9), and (2.10) lead to 
flu) = (ul expi argu) = fs(\ul) + falexpiargu) = fs (ul); 


that is, 
f3(u) = a log |u| 


for every complex number u 4 0. Thus 73(P) = aln|det P| for any nonsingular 
matrix P. This concludes the proof. 


Result 2.7. [575]. Let T, be a unitarily invariant function defined on M,(C) that 
has values in M;(C). Let 


(2.11) Ty(PS) = Ty(P)-T4(S) (mM) 
for every pair of matrices P and S. Then 
Ts(P) = f4(det P), 
where the function f4 of a complex variable u satisfies the functional equation 
(2.12) fa(uv) = fa) far) 


for all complex numbers u and v. 
If the function T4 is continuous and if det T(P) 4 0 for every matrix P, then 


T4(P) = exp(a log | det P| + iN arg det P) 


for every nonsingular matrix P, where a and N are square matrices of order m. 
Matrix N is similar to the diagonal matrix that has integers on the main diagonal. 


Result 2.8. [575]. Let T3 and T4 be orthogonally invariant functions defined on 
M,,(R) with ranges in M;(R). 
If 
T3(PS) = T3(P) + 13(S) and T4(PS) = T,(P) - Ta(S) 


for every pair of matrices P and S in M;,(R), then 
T3(P) = fa(\det Pl) and T4(P) = fa(det P). 


Result 2.9. [575]. Let T be defined on M,(R) and have values in M;(R). Further, 
let T satisfy (Q). 


(2.13) TIPS) 47 (P = 8) a 27 (P) 427) 


holds true for every pair of matrices P and S. 
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If the function T is invariant with respect to a group of all nonsingular matrices 
(i.e., if 
T(S-'PS) =T(P) 


for every nonsingular real matrix S and if T is a continuous function), then 


T(P) =a S1,) +o 


j=l ial 


as ate = aS?(P) + bS2(P) 


for every matrix P where real numbers a and b do not depend on the matrix P.. 
The proof of this result is based on the following theorem (see Chapter 5). 
Let H be a real Hilbert space and let a : H — R satisfy that 


a(x + y) + a(x — y) = 2a(x) + 2a(y) 


holds true for x, y € H. If the functional a is continuous in one point or bounded on 
one sphere, then 

a(x) = (T'x, x) 
for every x € H, where the bounded and symmetric linear transformation T' does 
not depend on x. The transformation T' is uniquely determined by the functional a. 


2.1 Multiplicative Equation 


Now we determine the general solution of 
(mM) S(XY) = f(X)FW), X,Y € M(B), 


where f : M,(R) > M;(R), n,k > 1. 

The equation (mM) plays an important role in the theory of geometric objects 
and in the theory of invariants. This equation has been studied by many authors. For 
some values of n and k, it has been solved under strong assumptions of regularity of 
the function f (see, for example, Kucharzewski [544, 545, 546], Kuczma and Zajtz 
[564]). 

Without any regularity supposition about the function f, equation (mM) has been 
solved form = 2, k = 1 in Golab [322]. This result has been generalized to the case 
of k = | and an arbitrary n in Kuczma and Zajtz [565]; see also Aczél [12]. 

In the general case (m,n arbitrary), equation (mM) has been solved in Kurepa 
[575]. However, though the author makes no assumptions concerning the regularity 
of the function f, he imposes on f conditions of another kind (invariance under some 
operations) (see Theorems 2.2 and 2.6). 

The equation (mM) for k = n = 2 is treated in Kuczma [548] in Result 2.10. 


They find all solutions of the equation making no suppositions about the required 
function f and assume that relation (mM) holds only for nonsingular matrices X, Y. 


For n = 2, k = 1, the solution of (mM) without any additional condition is 
determined in Golab [322]. 
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Theorem 2.10. (Golab [322]). Suppose f : M2(IR) — R satisfies (mM). Then 
(2.14) f (X) = M(det X), 


where M : R —> R satisfies (M). 


Proof. Based on the proof in [322], we present the following proof. 

It is obvious that f = 0 and f = | are solutions of (mM). We look for nontrivial 
solutions. 

First the idea is to express f, a function of four variables, as a product of four 
functions of single variables a, 0, 8, @ : R — R, two of which satisfy (E) on R that 
are constants and the two others satisfy (M) on R. Define (introduce) 


ade) = f00,0,0,1)= (99). 
a(x) = Fl. 1.0.49= 4 (5 1). 
poy = 70,0.x.0=F()4), 


and 


oy = F040, =7(5 5). forx ER. 


a and @ satisfy (M), and £ and ¢ satisfy (E). Indeed, use (mM) to get 


acs = F793 9) = 409) (53) ] =e aon, 


pty) =f @ ie =f (! ') (; ') | = B(x) By), 


forx,y ER. 
Similarly, we can show that @ is multiplicative and ¢ is exponential. If aj; 4 0, 


then a 
a1 412 1 0 ai 0 no) ac 
X= = . . 7 au 
2) (a ) ( 0 ) (1 a vay 


where A denotes the determinant of the matrix X. Thus 


(2.0), 


a 


(2.15) f (a,b,c, 4) = ala)B (<) 


fora £0. 
Whenever M(a) = 0 for some a (4 0) € R, then M = 0 (for M(x) = 
M (Xa) = M (4) -M(a) = 0). Since f 4 0, @ and 6 are never zero and 
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a(1) = @(1). If E(a) = 0 for some a € R, then E = 0. As before, since f #0, £ 
and ¢@ are nonzero and £(0) = 1 = (0) (see also Chapter 1). First we show that 


p=1=¢. 


: al 11 : 
With X = [ ,) , i= ¢ 7 , (mM) and (2.15) yield 


a(a)p(1ya (=) ¢ (=) = a(a)p (=) 4 (=) sa pOAMt): 


that is, d(a) = lorg=l,ord (+) = 1 or f(x) = 1 forall x € R, except at 
2 
x = 1. But then, since ¢ satisfies (E), 611) = ¢ (3) = 1.Hence d= 1. 


Set X = (; ’) a (; 5) and use (mM), (2.15), and ¢ = | to obtain 


a(1)B(d)0(bd) = a(l)B(O)O(d) - aC BU)A), 
which gives £(d) = 1 (use @ is multiplicative, a(1) = 1), so that 


ad — be 


(2.15a) f(a,b,c,d) = a(a)o ( ) fora £0. 


Finally, we show that a = 0, so that f(a,b,c,d) = ¢ (<ct (: ‘)) with ¢ multi- 
plicative, which is (2.13). 
Let X = ( ) ry a C ) in (mM). Use (2.15a) to have 


a(b)0 (=) = a(1)0(d) -a(1)A(c): 


that is, a(b)¢ (;) = | (since @ is multiplicative, a(1) = 1). Thus, a(b) = @(b) for 
all b (4 0), and (2.15a) yields 


(2.14) f(a,b,c,d)=¢ (<ct (: )) : 


where ¢ satisfies (M). This proves the theorem. 


The casen = 2=k 


Result 2.11. (Kuczma [548]). Let f : M2(R) — M2(R) satisfy the functional 
equation (mM) for all nonsingular matrices X and Y. Then we have 


(2.16) F(X) =0 
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or 
(2.16a) f(X) = d(det X)-C ( 5) Cc 
or 
(2.17) f(X) = d(detX)-C-X-C7! 
or 
(2.17a) f(X) = G(X), 
where ¢ is given by 

60) = "0° moo l: 


where m ¥ Ois an arbitrary multiplicative function on R, G is an arbitrary (matrix- 
valued) function satisfying the equation (M) and the condition G(1) = In, and C is 
an arbitrary nonsingular matrix (and thus playing the role of a parameter in formu- 
las (2.16a) and (2.17)). Formulas (2.16) and (2.16a) give the singular solutions of 
(mM), while solutions (2.17) and (2.17a) are nonsingular. 


Case n arbitrary, k = 1 

Let f : M,(F) > F (F = Ror C) satisfy (mM). It was shown that f has the form 
(2.14). A simplified proof is given in Kucharzewski and Kuczma [547]. Here we 
present the proof in Hosszt [379] (also see Karenska [527]). 

It is a well-known theorem [299] that every matrix X has a (polar decomposition) 
factorization X = HU, where H is Hermitian and U is unitary, and hence both 
factors are equivalent to diagonal matrices. On the other hand, the value of f is the 
same for equivalent matrices, just like the value of the determinant, since 


fFCxe Ys fOROreVeSfOnC \fOSfCe ys 7. 


So X is a product of two function values depending on diagonal matrices, and 
hence it depends only on a diagonal matrix D having the same determinant as X 
since the determinant is also a multiplicative function. Therefore, considering the 
factorization 


d, 0: 
_ [0a 0 
2 Otay Oo 
0 QO: See sdn dn 
10. 10... 
x oe 0d, 0 010. 
~ 00 1 0 
00--- 0d, 
n dy, O 
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where P; consists of the elements of the unit matrix but the first and kth rows are 
permuted, we get 


n dy O +: 
f®=fD0)=[][F[ 01 0-- 
k=1 cae 


n 
Il & 0 
= k=1 
=fI 0 1 0 


= f (det D) = f(det X) 


for every X € M,,(F). 
The theorem proved above gives the possibility of axiomatizing determinants 
without coordinates by the multiplicativity and homogeneity: 


fUX) =A" F(X), 1eF, XM, (F). 


Result 2.12. [565]. Let f : R* — M3(R) satisfy (mM). Then f has one of the 


forms 


f@)=C 
0 0 M3(x) 

M(x) M(x)L(x) +O 
0 M(x) i Je, 
0 0 M3(x) 


f@)=C ( 
g(x) —o(x~) 0 
a(x) g(x) 0 Cc 
0 0 M(x) 
M(x) M(x)Li (x) M(x)L2(x) 
0 M(x) 0 Cc 
0 0 M(x) 


M(x) M(x)Li(x) 0 

0 M(x) o- 7c. 

0 M(x)Lo(x) M(x) 

M(x) M(x)Li (x) M(x)[La(x) + 5L7(@)] 

fa)=C{ 0 Mi) M(x)Li (x) ome 
0 0 M(x) 


f@)=C 


f@)=C 


f@)=C 


where M, M,, M2, M3 satisfy (M), L, L1, Lz satisfy (L), C is a nonsingular 
matrix, and g and o satisfy the system of functional equations 


g(xy) = gx)g(y)-o@)o(y), ohy)=o(x)gQ)+e@)o(y), x,y £0. 
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Case n = 1, k arbitrary 

See Kuczma and Zajtz [564] and Karenska [527] for the solutions f : R > 
GL(n, R) or GL(n, C) of (mM), where GL(n, R) (linear group) is the set of all 
nonsingular matrices of order 7 over R. 


Casen=2, k=3 
We now consider the functional equation 
(mM) M(XY) = M(X)- M(Y), 


where X and Y are nonsingular 2 x 2 real matrices (i.e.,.x, y € GL(2, R)) and M 
is an unknown function whose values are 3 x 3 real matrices. We do not make any 
assumptions concerning the regularity of the matrix function M. 


Result 2.13. A function M satisfying functional equation (mM) for every X,Y € 
GL(2, R) is given by the formulas 
X11 X12 0 
M(X) = C-m(det X) | x21 x22 0 Fae 
0 O mo(det X) 
a 2x 11X12 eon 
M(X) = C-m(det X) | xiixa1 x11%22 + ¥21K12 X12x22 | -Co!" 
x 2x21X22 co 
M(X) = G(det X) = —C - G*(detX)-C7!, 


where G* (det X) has one of the six forms 


m (det X) 0 0 
G* (det X) = 0 m2(det X) 0 ; 
0 0) m3 (det X) 


1 L(detX) 0 
G* (det X) = m(det X) | 0 1 0 
0 0 mo(det X) 


x (det X) —o (det X) 0) 
G* (det X) = | o(detX) y(det X) 0 
0 0 m3(det X) 


1 Li (det X) 507 (det X) + Lz (det X) 


G* (det X) = m(det X) | 0 1 L (det X) , 
0 0 1 
1 L(det X) L2(det X) 

G* (det X) = m(det X) | 0 1 0 ; 
0 0 1 


10 Li (det X) 
G* (det X) = m(det X) { 0 1 Lo(det X) J , 
00 1 
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M(X) =0, 
m(det X) 00 
M(X)=C.- 0 o00}-c", 

0 00 


m(det X) 0 0 X11 X12 0 
0 m(detX)0)- [x21 x22 0) -c7!, 
0) 0) 0) 0 00 


mi(detX) 0 O 


MOO=c: 


ee ee 


M(X)=C.- 0  mo(detX)0)-c7!, 
0 0 0 
m(det X) m(det X)L(det X) 0 
M(X)=C.- 0 m(detX) O]}-c7!, 
0 0 0 


x (det X) —o (det X) 0 
M(X)=C.-|o(detX) y(detX) 0)-c7!, 
0 0 O 


where m and mj (i = 0, 1,2, 3) are arbitrary multiplicative functions not vanishing 
identically (i.e., satisfying the equation (M)), L and L; (i = 1,2) are arbitrary 
functions satisfying the equation (L), and the functions x and o are a solution of the 
system of functional equations 


xen) = xO)X() — a C)o(n), 
a(n) = x)o(n) + x(a (6), 


fulfilling the condition 


on #0, 


X21 X22 


o(€)#0, X= Ge he 


The solutions of the function equation (mM) given by the first nine are nonsingular, 
and the rest are singular. 


Remark. The general nonsingular solution of the functional equation represents all 
homomorphisms M : GL(2,R) — GL(3,R). It means all real linear representa- 
tions 3 x 3 of the group GL(2, R). 


In McKiernan [630] (see also [631]), the following system of equations was con- 
sidered, where summation on integer n is assumed. 
Find all ai :G > F satisfying 


(2.18) a(x oy) =ai(x) +a,(x)a"(y) +440), 
together with 
(2.18a) a’, (x) =0 foralli < j, forallx,y eG, 


where G is a groupoid with 0 and F is a ring. 
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In view of (2.18a), summation on all n in (2.18) reduces to summation on j < 

a <1 for each fixed i, j. In the event that a’, depend only on the difference i — j, 
this problem reduces to (2.19) with gj; := a',, 

k-1 
(2.19) g(x + y) = glx) + >) ale) ge—a(y) + gx(y), 

o=) 
where gj : Ror C — F is connected with a characterization of “polynomials of 
binomial type” [690] to a composition law for Poisson distribution [49]. 


Result 2.14. (McKiernan [630]). Suppose G is a commutative groupoid with F a 
commutative and associative algebra over the rationals and F, the infinite matrices 
with elements in F. Then a necessary and sufficient condition that there exist a : 
G —> Fy satisfying 


a(xo y) =a(x) + a(x)a(y)+a(y) forallx,y €G, 
a(x) ~0 for 1—j <0and forallx €G, 


is that there exist L: G — Fo satisfying 


L(x) =0 fori — j <Oandallx €G, 
L(xoy)=L(x)+LQVy) forallx,y €G, 
L(x)L(y) = LOY)L(®) forallx,y €G, 


and given sucha, L, then 


CO 


1 
a(x) = » ley forallx €G. 
k=1 ~~" 


2.2 Cosine Matrix Equation 


Let G be an Abelian group, and consider the functional equations 


(2.20) fe ty) = Do i@kKGO) 
i=l 
and 
(2.21) fe t+y)+a@—y) = DAKO), 


i=1 


where n is a fixed positive integer and f, g, hj, k; are complex-valued functions 
on the group G. Equations of the form (2.20) and (2.21) have been treated by many 
authors (see Chapter 3). 
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Aczél raised the question, what is the general form of the solutions of (2.20) and 
(2.21)? In some cases, the general solution is well known even in the case of Abelian 
semigroups. For instance, (2.20) is a common generalization of the additive and 
multiplicative Pexider equations, and (2.21) is a generalization of the d’ Alembert 
equation. 

Concerning (2.20), in Kuczma and Zajtz [565] and McKiernan [630] it is 
observed that it is closely related to the matrix equation 


(mE) E(x +y) = E(@)E(y) 


and can be solved by the simultaneous diagonalization of the matrices E(x). Using 
this method, in [630, 631] is given the general solution of (2.20) under some simple 
conditions. 

In [787], the author extends the above-mentioned method for solving (2.21) by 
reducing it to the matrix equation (mE) and 


(C) C(x +y)+C@ — y) =2C(%)C(), 

and all solutions f, g of (2.21) are exponential polynomials and so are the functions 
h;, k; under some conditions on their linear independence. 

The Matrix Equation (C) 


We show now that, under the assumption C (0) = /;,, the components of any solution 
C of (C) are exponential polynomials. 


Theorem 2.15. (Székelyhidi [787]). Let G be a topological Abelian group, n a 
positive integer, and C : G + M(C") a continuous function for which the functional 
equation (C) holds for all x and y in G. If C(O) = In and C(x)? — In is regular for 
some xq in G, then there exists a continuous function E : G + M,(C") satisfying 
(mE) such that 


1 
C(x) = 5(E®) + Ex) 
holds for all x in G. (See [424] and Chapter 3, Theorem 3.21.) 


Proof. The proof follows the method in [424]. As C(O) = I,, we see from (C) that 
C is even. Then, by (C), 


1 1 
C()CO) = SICH +) + CG] — y= SICH +2) + CO —x)] = CONCH); 


that is, the matrices C(x) commute. On the other hand, it follows that C(2x) +7 = 
2C(x)? for all x in G. Denote the matrix C(xo) by P. It is well known [376] that 
there exists a regular matrix S$ that commutes, with every matrix commuting with 
P? — 1,, and S satisfies the equation S? = P* —],. But 


1 
P? -lJ= C (xo)? -h= 5 (C20) = In) 
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commutes with every value of C, and hence so does S. By (C) we have the following 
equations for x and y in G: 


2C(x + y)C(x — y) = C(2x)+ Cy), 
2[C (xo + x)C(y) + Co + IC) = Co +x +y)+Co+x — y) 
+C(xo+x+y)+C(xo+y— x) 
= 2C(xo +x + y) + 2C(x0)C(x — y) 
= 2[(CQ@o +x + y) + PA2C(X)C(Y) 
=F); 


and 
2C (xo + x)C(xo + y) = CQx9 + x+y) +C(x —- y) 
= [2C(x0)C (xo +x + y) — C(x + y)I 
+ 2C@)C(y) -— C+ y)] 
= 2[C(x)C(y) + CQ@o)C@o+x+y)-C&+t+y)I, 
and finally 
[C@+y)-C@—y)P = (CO +y) + CO — y)P -4C@ + ICA — y) 
= 4C(xY CQ)’ —4C(x)* — 4€0)? + 41 
= 4(C(x)’ —D(COY — D. 
Let 
E(x) = C(x) + S'[C(ao + x) — C(x) C@)] 
= S~'[C(xo +x) + (S — P)C(®)) 
for all x in G. Then an easy computation shows that E satisfies (mE) and by defini- 
tion 


5(E@) + E(-x))= 5S UCCo +x) +C(xo — x) + 2(S — P)C(x)] = CQ), 


which was to be proved. 


Result 2.16. [787]. Let G be an infinite topological Abelian group in which division 
by 2 is defined, n is a positive integer, and C : G + M,(C) is a continuous function 
for which the functional equation (C) holds for all x and y in G. If C(O) = In, 
then there exist continuous functions E : G — M,(C) satisfying (mE) and N : 
G — Mn-n,(C) satisfying (C) such that E(x) is triangular from above for all x in 
G, N(x) — I, is strictly triangular from above for all x in G, and further there is a 
regular matrix T in M,(C) for which the representation 


C(x) = T7'diag | N(x), 5(E@) + E-x)| a 


holds for all x in G. 
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For the following result only, we denote the elements of R” columnwise. Sup- 
pose f, g,h,k : R"” — R* satisfy the functional equation 


(2.22) flax + By)g(yx + dy) =A(x)k(y), forx,y eR", 


anda, /£, y, 6 are nonsingular square matrices of order n; that is, in GL(n, R). 
Note that f, g, h, k are never zero. 
Let F,G, H, K : R" > R*%* be defined by 


fix) a) _ kG) 
fo” °P-7q@ F@=7o 


and note that F(0) = G(O) = H(0O) = K (0) = 1 and (2.22) becomes 


g(x) _ k@) 


~ k(0)’ 


F(x)= for x € R", 


K(x) 


(2.23) F(ax + By)G(yx + dy) = H(x)K(y), forx,y € R". 


The following theorem is obvious. 
Theorem 2.17. Under the condition f, g,h,k #4 0, the solutions of the functional 
equation (2.22) are given by the functions 


f(x) =aF(x), g&)=bG(Q), haw)=cH(x), k(x) =dK(a), 


where F(x), G(x), H(x), K(x) are the solutions of the functional equation (2.23) 
under conditions F (0) = H(O) = G(O) = K (0) = 1 and (2.23), anda, b, c, dare 
arbitrary constants satisfying 


ab—cd=0, abcd #0. 
Proof. By putting x = 0 and y = 0 into (2.23) separately, we have 


(2.24) F(ax)G(yx) = H(x), 
(2.24a) F (By)G(6y) = K(y). 


Substituting (2.24) and (2.24a) in (2.23), we obtain 
(2.25) Flax + By)G(yx + dy) = F(ax)F (by) G(y x)GOy). 
For u,v € R, there are x, y € R” such that 
u= Ax and v= By, 
and (2.25) can be rewritten as 
(2.26) F(u+v)G(Pu + Sv) = F(u)F(v)G(Pu)G(Sv), 


where 
P=ya', S=op"'. 
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Case 1. Let P = S. Then (2.26) reduces to 
(E) E(u +o) = E\u)E(v), 


where 
E(u) = F(u)G(Pu), 


and from (2.24) and (2.24a) we obtain 
H(u) = E(au), K(u) = E(Bu). 

Thus, in this case we get 
(2975 h(x) =cE(ax), k(x) =dE(Bx), 


J, g arbitrary with 

f @)g(ya~'x) = abE(x), 
where EF satisfies (E), a = f(0), b= g(0). 
Case 2. Let P 4S. 


The proof involves the functional equation 


(4.22) p(x +y)+O@—y) =2(d)+ 60) +4(-y) 


studied in Chapter 4. 

If one of the inequalities p(T) < 1 or p(T~!) < 1(T = df7'ay—!), where 
p(T) is the spectral radius of 7, then the continuous solution of the equation (2.22) 
is given by 


f(x) =a exp(b}.x +x'Nx), g(x)= bexp(b5x +x'N>x), 


ae h(x) = cexp(bhx + x'N3x), k(x) = dexp(byx + x'Ngx), 
where 

(2.29) N3=a'Niaty'Noy, Na= fp'NiB+06'N20, 

with 


a NiB + y'N26 =0, b, = bia + boy, bi = bi B + b2d, c=h(0), d =k(0), 


by, bz, b3, ba € R". 
When P = —S and ay! + £6~! = 0, the continuous solutions of the equation 
(2.22) are given by 


f(x) =aexp(x'Nix), g(x) = bexp(x' Nox), 


(2.30) h 1 t 
(x) =cexp(x'N3x), k(x) = dexp(x' Na4x), 


where N3, Nq satisfy (2.28) and aN, B + y'N26 = 0. 
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Result 2.18. (Ecsedi [248]). The continuous solutions of (2.22) are given by either 
(2.27), (2.28), or (2.29). 


For the solution of the matrix equation 


D> fo@)Xgp(B) =C, 


p=l 


where a € M,(C), B € Mn(C), C € Minn(C), X € Minn(C), see Wimmer and 
Ziebur [832]. A special case of this is the well-known equation aX + Xf = C. 


3 


Trigonometric Functional Equations 


In this chapter, trigonometric functional equations are studied on reals, groups, 
Hilbert space, etc., and cosine equations on reals, non-Abelian groups, and Hilbert 
space are studied. Sine equations, analytic solutions, addition and subtraction formu- 
las, operator values, and some generalizations are studied also, and counterexamples 
are provided. 

One of the important applications of functional equations is a functional charac- 
terization of various functions such as Euler’s gamma function, Lebesgue’s singular 
function, cyclic functions, polynomials, exponential and logarithmic functions, etc. 
(the last three are treated in Chapters 7 and 1). The most extensively studied prob- 
lem of this type is the functional characterization of the trigonometric functions. 
Trigonometric functions—the sine and cosine functions—can be introduced in many 
ways: in terms of infinite series, as solutions of differential equations, by exponen- 
tial functions, or by geometry. In this chapter, the stress is going to be on functional 
equations. In this section, we will treat many functional equations connected with 
trigonometric functions, especially sine and cosine. We are all familiar with so many 
trigonometric formulas (identities). From the (addition) formulas 


(3.1) sin(x + y)=sinxcosy+sinycosx, x,yéER, 


one obtains 


(3.2) sin(x + y)+sin(x — y)=2sinxcosy, forx,y €R, 

(3.3) sin(x + y) — sin(x — y) = 2sinycosx, x,y ER, 

(3.4) sin(x + y)sin(x — y) = sin? x — sin’ y, x,y ER, 

and (3.4) leads to the study of the sine functional equation 

(S) g(x +y)ge—y)=8@)-g) forx,y ER. 

From the (addition) formulas 

(3.5) cos(x + y) =cosxcosy Fsinxsiny, forx,y €R, 
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one derives 


(3.6) cos(x — y)—cos(x + y) =2sinxsiny, x,yéER, 
(3.7) cos(x + y)+cos(x —y) =2cosxcosy, forx,y ER, 


and (3.7) leads to the study of the well-known functional equation known in the 
literature (Aczél and Dhombres [44], Aczél [12], Davison [211], Kannappan [426], 
Kurepa [577], and Stetkaer [772]) as the cosine equation, d’Alembert equation, or 
Poisson equation, 


(C) faty)+f@—y)=2f@)fO), forx,yeR. 


Functional equations arise from applications—the parallelogram law of forces 
or the rule for addition of vectors leading to the study of (C) and the problem of 
vibration of strings leading to the study of 


(VS) f@+y)—f@—-y)=e@)hQy), forx,y eR, 


were solved by d’ Alembert in three papers in 1747, 1750, and 1769 [189] (see among 
others Poisson [668, 669], Aczél [12], Aczél and Dhombres [44], Kannappan [426] 
and Kurepa [580]). The solution of (C) was obtained by d’Alembert by reducing 
(C) to a differential equation, a method quite common until the middle of the last 
century. One of the interesting aspects of functional equations is that, unlike in 
differential equations, one equation can determine many unknown functions. (VS) 
contains three unknown functions. 

We are going to look at these identities and others from the point of view of 
functional equations and obtain some interesting results. 

We will obtain general solutions of these and many more functional equations 
satisfied by trigonometric functions in this chapter. We will solve these equations 
without assuming any regularity conditions by sometimes reducing them to Cauchy 
and Pexider equations. We will also treat these equations on different structures 
for domain and range; that is, on abstract spaces. The literature in this area is 
voluminous—for a partial list, see [12, 40, 44, 88, 147, 162, 189, 211, 282, 348, 
422, 552, 426, 424, 577, 582, 579, 580, 477, 419, 717, 729, 772, 750, 795, 776, 767, 
830, 831]. 

Sometimes the established solutions of the unknown functions result in the usual 
trigonometric identity, but sometimes they do not. When the unknown functions are 
specialized as sine or cosine, the results obtained turn out to be the familiar trigono- 
metric identity. But the unknown functions need not be sine or cosine functions. 
We illustrate these in Section 3.1. 

First we will consider simple equations (mixed) containing trigonometric and 
general unknown functions and obtain some unexpected results. Then we will treat 
general trigonometric equations that do not contain trigonometric functions. 
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3.1 Mixed Trigonometric Equations 


We start off with the equation 
(3.8) faty)+f@—-y)=2cosxcosy forx,yE€R, 


where f : R > R (cf. (3.7)). 
y = 0in (3.8) yields f(x) = cos x (no surprise), giving the following result. 


Result 3.1. A function f : R > R satisfies (3.8) if and only if f (x) = cos x. 
Now we replace the cosine by the sine and consider 
(3.9) faty)+f@—-—y)=2sinxsiny forx,y ER, 


where f:R—-R. 
Equation (3.9) has no solution (not surprising); put y = 0 to get f(x) = 0, which 
is not a solution of (3.9). But if we consider for f,g: R— R 


(3.10) faty)+g@—y)=2sinxsiny, forx,y eR, 


we get a nontrivial solution. 
Equation (3.10) can be rewritten as 


f(x +y)+ g(x — y) = cos(x — y) — cos(x + y) 
or 


flu) +cosu =cosv — g(v) =c 


(setx +y =u, x —y =v). Thatis, f(u) =c—cosu, g(u) = cosu —c. Hence 
we get the following result. 


Result 3.2. The only solution f,g : R — R of (3.10) is given by f(x) = c¢ — 
cosx, g(x) =cosx —c (cf. (3.6)). 


Result 3.1a. Let f : R > R be a solution of the equation 
(3.9a) faty)—-f(—y)=2sinxsiny forx,yER. 
Then f (x) = —cosx +c, where c is a constant (cf. (3.6)). 
Proof. Let f satisfy (3.9a). Then 
Fx +y) — f@ — y) = cos(x — y) — cos(x +9); 


that is, 
f@ + y) + cos + y) — f(x — y) + cos(x — y) 
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(with x + y =u, x — y =v) shows that 


fu) =—cosu +c, 


which proves the result. 


Whereas for f,g: R—R 
(3.11) fi«ty)+f@—y)=2cosxsiny, forx,yeéER, 
has no solution, 
(3.12) faty)+g(«*—y)=2cosxsiny, forx,yER, 


has the solution f(x) = sinx +c, g(x) = —sinx —c (cf. (3.10), (3.3)). 

In (3.11), let y = 0 to have f(x) = 0, which cannot be. 

As for (3.12), write the right side of (3.12) as sin(x + y) — sin(x — y) (then use 
x+y =u, x — y =v) to get the solution sought. 


Result 3.2a. The only function f : R > R satisfying the functional equation 
(3.11a) fat+y)+ f@—y)=2sinxcosy, forx,y ER, 

is f(x) = sinx (cf (3.2)). 

Proof. Put y = x in (3.1 1a) to have 


f (2x) + f() = 2sinx cosx = sin2x, 


which gives the desired result (put x = 0 to get f (0) = 0). 


Now we will solve 
(3.13) f@ty)+g@—y)=2sinxcosy forx,y ER, 
where f, g: R > R (cf. (3.2)). 


Let y = 0 in (3.13) to obtain f(x) + g(x) = 2 sinx. Replace y by —y in (3.13) 
and subtract the resultant from (3.13) to have 


fa+y)+g@—y)—-f@—-y)-gat+y)=0 
or 
flu) — gt) = fe) — gv) =c¢ 


(setx+y =u, x —y =v). Hence f(x) = sinx + § and g(x) = sinx — 5. Thus 
we have proved the following result. 


Result 3.3. The only general solutions f, g : R > Rof (3.13) are f(x) = sinx + 
c, g(x) =sinx —c, where c is a constant. 
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Associativity is a useful tool. We have used it before in Chapter 1. We will 
highlight its usefulness in the next result and at many places in this chapter. 


Result 3.4. Let f : R > R be a solution of the equation 


(3.14) faty)+f@-—y)=2f@)cosy forx,yeR 
(cf. (3.6)) (see [12, 44, 144, 426, 580]). Then g : R — R defined by g(x) = 


f (x) — dcosx satisfies the functional equation 


(3.15) g(x+y)=g(x)cosy+g(y)cosx forx,yeR 


(cf. (3.5)). The general solution of (3.15) is given by g(x) = bsinx, and that of 
(3.14) is given by f (x) =dcosx +bsinx. 


Proof. Let x = 0 in (3.14) to have f(y) + f(—y) = 2d cos y. 
Interchange x and y in (3.14) and add the resultant to obtain 


2fa+y)+ fa-—y)+ fy —x) =2f@)cosy + 2f(y)cosx, 
which by the relation above becomes 
2f(x + y) + 2d cos(x — y) = 2 f(x) cosy +2f(y)cosx; 


that is, with g(x) = f(x) — dcosx, the equation above can be written as (3.15). 
Now we solve (3.15), using the associativity of (++) and (3.15), 


g(x +y +z) = (g(x) cos y + g(y) cosx) cosz + g(z) cos(x + y) 
= g(x) cos(y + z) + (g(y) cosz + g(z) cos y) cos x, 


which by equating the right sides yields 
g(x) siny sinz = g(z)sinysinx forx,y,z ER. 


Fix y,z with siny 4 0, sinz 4 0 to obtain g(x) = bsinx and then f(x) = 
dcos x + bsinx. This proves the result. 


Remark. Whereas g(x) = b sin x in (3.15) gives the addition formula for sine (3.1) 
and the solution also is no surprise, f(x) = d cos x in (3.14) gives the familiar addi- 
tion formula for cosine (3.5) but f(x) need not be cosine. Further, the same conclu- 
sion and result hold even for f : R > C, satisfying (3.14). Note that the solutions 
of (3.14) and (3.15) are obtained without assuming any regularity condition. 


Similar to equation (3.14), we consider (3.16) in Theorem 3.5. 


Theorem 3.5. For f : R — R, there isa g : R > R satisfying the functional 
equation 


(3.16) fe —y)— fe +y)=2g)siny, forx,y ER, 


if and only if f(x) = acosx —dsinx +c, g(x) =asinx +dcosx. 
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Proof. Set x = 0 in (3.16) to get f(—y) — f(y) = 2dsiny. 
Interchange x and y in (3.16) and subtract the resultant from (3.16) to have 


f@ —y)— fy —x) = 2g(x) siny — 2g(y) sinx; 


that is, 
—2d sin(x — y) = 2g(x) siny — 2g(y) sinx 
or 
(g(x) — dcosx) siny = (g(y) — dcosy) sinx, 
so that 


g(x) =dcosx +asinx 
(fix y with sin y 4 0). Substitution of this g(x) in (3.16) results in 
f(x-—y)-— f+ y) =2(dcosx + asinx) sin y 
= d(sin(x + y) — sin — y)) + a(cos(x — y) — cos(x + y)) 


or 
fu) —acosu+dsinu =c 


(setx + y =u, x — y =v). This proves the result. 


Remark. f(x) =cosx, g(x) = sin x results in the trigonometric identity (3.6), but 
f need not be cosine and g need not be sine. 


Now we will consider a few equations related to sine before treating general func- 
tional equations connected with trigonometric functions and on different structures. 
First we consider the three simple equations 


(3.17) sin(x + y) = f(x)siny+ f(y)sinx, forx,yeR 
(cf. (3.1), [118]), a slight generalization of (3.17), 

(3.18) sin(x + y) = f(x)siny+ g(y)sinx, forx,y é€R, 
and a generalization of both (3.17) and (3.18), 

(3.19) f@+y) =g()siny+ g(y)sinx, forx,yeER. 


As for (3.17), set y = 4 to have cosx = f(x) + f (¥) sinx, which with x = 4 
yields f (3) = 0 and f(x) = cos x (no surprise). 

As for (3.18), as before, let y = % to get cosx = f(x) + g (4) sinx or 
f(x) = cosx + bsinx, and substitution of this f(x) in (3.18) shows sinx cos y = 
bsin x sin y+g(y) sinx; thatis, g(y) = cos y—b sin y (choose x such that sinx 4 0). 
Both equations can also be solved by expanding sin(x + y). 

Finally, put y = 0 in (3.19) to obtain f(x) = g(0) sinx = asinx. Then y = 4 
in (3.19) yields 
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1 T 
x+ 5) = (=) sinx + xe 
f(x+S)=8(5 ex) 
that is, 
1 
acosx = g (5) sinx + g(x). 


us 


Setting x = 5, we see that g (4) = 0 and g(x) = acosx (no surprise in (3.19)). 
Thus we have proved the following result. 

Result 3.6. A function f : R > R satisfying (3.17) is given by f(x) = cosx; 
general solutions f, g : R — R of the equation (3.18) are given by f(x) = cosx + 
bsinx, g(x) = cosx — bsinx; and functions f, g : R — R satisfy (3.19) if and 
only if f(x) = acosx = g(x). 

Remark. There are no surprises in (3.17) and (3.19). Whereas f(x) = cosx and 


g(x) = cosx yield the familiar formula for the addition of sine, f and g need not be 
cosine. 


Next we treat the following equations and solve the second equation using asso- 
ciativity, which comes in handy in many places: 
(3.20) fixty)f@—y) =sin?x —sin? y, forx,y ER, 
(3.21) fet yf —y) = f(x)’ —sin’y, forx,y eR 


(replacing f on the right side of (3.21) with g makes no difference). 
Now (3.20) can be rewritten as 


f(x + y) f(x — y) = sin + y) sin — y) 


or 
f@)fo) =sinusnvo (x«+y=u,x-—y=vd). 

Noting that f = 0 is not a solution of (3.20), we obtain f(u) = k sinu (fix v). This 

f is a solution of (3.20) provided k? = 1. Thus we have proved the following result. 


Result 3.7. The only solution of (3.20) for f : R ~ Rhas the form f (x) = k sinx 
with k? = 1 (no surprise—cf. (3.4)). 


Theorem 3.7a. The most general solution f : R — R satisfying the functional 
equation (3.21) has the form f (x) = asinx or f(x) =bcosx +d sinx, anda,b,d 
are real constants with a2 = 1 = b* + d?. 


Proof. Set y = x in (3.21) to get 
(3.21a) fQx)f (0) = f(x)? —sin’x forx ER. 


We distinguish two cases according to whether f(0) = 0 or f(O) # 0. First 
suppose f (0) = 0. Equation (3.21a) yields fay = sin’ x, (3.21) becomes (3.20), 
and by Result 3.7 we obtain f(x) = asinx with a* = 1. Now assume f(0) # 0. 
Let f (0) = b. From (3.21) and (3.21a), we have 


112 3 Trigonometric Functional Equations 


fix +y)f(« — y) = fx) f 0) + sin? x — sin? y 
= bf (2x) + sin (x + y) sin (x — y) 


or 
flu) f(v) = bf (u+v) + sinu sino, 
which is 
(3.21b) bf(u+v) = flu)f(v) —sinusinv foru,v ER. 


Applying associativity, (3.21b) gives 
bftu+v+w) = flut+ov)f(w) —sin(ut+ v)sinw 


= (Flu) f(0)—sin sin v) f (w)—(sinu cos v+cos u sinv) sin w 


= fF w)(F (0) fw) —sin v sin w)—sinu(sin v cos w+cos v sin w); 


that is, 
1 1 
5 if (w) sinu + cos u sin w = pf wsinw + sinu cos w 


or 
(Fre) — cos v) sinu — (Fre — cos “) sin w. 


If f(u) = bcosu for all wu € R, we are done or divide by sinw (choose w with 
sinw # 0). Then we obtain 


f(t) = b(cosu+dsinu) = bcosu+dsinu for constants d, b. 


Substitution of this f in (3.21) gives d? + b? = 1. This proves the result. 


Remark 3.8. Most of the functional equations studied in Results 3.1 to 3.7 come 
under the general form 


A(f(xt+y),fa—y), f@), fQ), x,y) = 0 


(Aczél [12], Castillo and Ruiz-Cobo [144]), where H : R° > R is a known function 
and f : R > R is the unknown function to be determined. Note that the equations 
above are solved without assuming any additional conditions such as continuity, dif- 
ferentiability, etc., except that the functions satisfy the functional equations. Note 
also that the unknown functions are continuous (even differentiable). 


Now we turn our attention to general functional equations derived from several 
trigonometric identities. This domain is very vast. We consider a substantial part of 
it (see also Aczél [12], Aczél and Dhombres [44], Kannappan [426, 477]). 

Usually these functional equations have been studied from R — R, R > 
C, C — C, except by Kurepa and a few others, who studied different structures. 
We study these equations on reals, complex numbers, groups, Hilbert space, Banach 
algebra, etc. We are going to consider functional equations characterizing both 
cosine and sine separately and together and in conjunction with other functions. First 
we treat the d’ Alembert cosine equation. 
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3.2 Cosine Equation on Number Systems 


The cosine function satisfies the functional equation (C), so (C) is known as the 
cosine equation. The functional equation (C) has been extensively studied by many 
authors under various hypotheses. Naturally one expects the literature to be large. 
We have provided a good partial list. The proof of the problem of the parallelogram 
law of forces was reduced to the solution of (C) by D’Alembert [189], as seen in 
Chapter 1. Of course, the problem of the parallelogram law of forces is one of the 
oldest problems studied by means of functional equations. This equation was consid- 
ered for the same purpose by Poisson [668, 669] with the hypothesis of analyticity. 
In the literature, (C) is known as d’Alembert’s equation and Poisson’s equation. The 
equation (C) finds applications in non-Euclidean mechanics and geometry (Aczél 
[12]). The functional equation (C) is closely connected with Cauchy’s exponential 
equation (E), as we will see later. 

Functional equations are solved by various methods such as reducing to differ- 
ential equations, using integral transformations, reducing, to integral equations, the 
method of determinants, and so on. We illustrate these methods with examples. 

Usually the study of (C) has been done on R > R, R > C, C > C. We will 
treat (C) on other spaces, too. We start off with the study of (C) on reals. 


Theorem 3.9. (Aczél [12], Aczél and Dhombres [44], Cauchy [147], Castillo and 
Ruiz-Cobo [144], Kannappan [426], Kurepa [580]). When f is a real-valued contin- 
uous function of a real variable satisfying (C) (that is, f : R — Ris a continuous 
solution of the cosine equation (C)), then f has one of the forms 


(3.22) f(x)=0 or cosbx, forx ER, 


where b € R or purely imaginary; that is, f (x) = cos bx or cosh bx for b real. 


Proof. Several solutions are available in the literature. We give here two proofs— 
one based on differentiability and differential equations and the other involving the 
integral transformation. Later we will see in Section 3.3 that there is no need to 
assume any regularity condition to solve (C). But to get a form like (3.22), it is 
necessary to assume some regularity condition. 

f() = d, a constant, is a solution of (C) when d = 0 or 1. So, hereafter 
we consider nonconstant solutions (that is, solutions not of the form f(x) = 0, 
Ff (x) = 1). 

First we start with some preliminaries that will be of use in what follows. 

Change y to —y in (C) or interchange x and y in (C) to have f(x) = f(—x) for 
all x € R; that is, f is even. 

Set x = 0 (or y = 0) in (C) to get f (0) = 1. 

With y = x, (C) becomes 


(3.23) fx)+1=2F()2, forx eR, 
or 
(3.23a) f (=) = i, forx ER. 
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From (C) and (3.23), we obtain 
2fle+yfe-y=fe+y+x—y)+ f@+y—-@—y)) 
= f(2x)+ fy) 
(3.24) = 2(f(x)>+ f(y) -1) forx,y ER. 
Hence, by (C) and (3.24) there results 


[fa@t+y)-f@-yP = @+y)t+f@—y)* —4f@+y)f@-y) 
=4f (xP fo) —4(f x)? + FO - D 
(3.25) = 4[ f («)* - 1ILf(@)* - 1 


or 


(3.25a) [f@t+y)—f@) FO)? = 41f@) - 11LfO)*- 1] forx,y eR. 


From continuity follows the differentiability of f, from which follows that f has 
derivatives of all orders. We require here derivatives of order two only. Differentiat- 
ing (C) with respect to y twice, we have 


fi@ty)-f'@-—y) =2f@)F'6O) 
and 
fF @+y)+ f"@—y) =2f@)F"O), forx,y eR, 
which, with y = 0, yields 
f'() = 0, 
and 
f(x) =kf (x), forx ¢R, k constant. 


Solution of this differential equation depends on k. 

k = Oresults in f” (x) = 0 or f(x) = ax +b. This f is a solution of (C) when 
a=Oandb=Oorl. 

When k > 0, there is a b in R such that k = b? and 


f(x) =csinbx +dcosbx, c,d constants. 


Conditions f(0) = 1, f’(0) = 0 imply d = 1 and bec = 0. b = 0 gives f constant, 
which is not the case. So, c = O and f(x) = cosbx, b € R, which is (3.22). Finally, 
k < OQ leads to 

f(x) =csinhbx + dcoshbx, 


where b is real and b* = —k. Conditions f(0) = 1, f’(0) = 0 lead to the solution 
f (x) =coshbx, b € R, which is (3.22). 


Now we will use integral transformation to obtain the same conclusion (3.22). 
Carstoiu (1947) remarked that integral transformation can be used to reduce func- 
tional equations. We explain his method by using the example of the d’ Alembert 
equation (C). If é(p) denotes the transform of f (x), it follows from (C) that 
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q¢(p) — pd) i qp(p) + po(q) 


aae sae $(p)o(q) 


that is, 


$(p)q°10(q) — 11 = (@)p"I¢(p) — 1, 


2 -=)- °( -s)=« 
al dp) 4 : $(q) , 
2 


o(p) = or (p) =0. 


and so 


pea=e 


Hence f(x) = 1 or cos./—cx or cos ./cx, according to whether c = 0, < 0, or > 0 
or f (x) = 0, which is (3.22) (see [12]). 


Remark 3.10. It is easy to see from Theorem 3.9 that the only continuous solution 
f :R— C of the equation (C) is (3.22), where b € C. 


This result (3.22) has been extended by Kaczmarz [409] to the case in which 
f is a complex-valued, measurable function of a real variable (weaker hypothesis, 
measurability implies continuity), and he studied the functional equation 


(3.26) fa)+fe+y)=0O)F (+5) forx,yeR. 
Now we consider the functional equation 


(3.27) faty)+ fa—y)=2f@)e(y) forx,y ER, 
which is the same as (3.26) (Kannappan [439], Van der Lyn [810], Wilson [830, 831], 
Kurepa [580]), where f, 9 : R — C, and prove the following theorem. 
Theorem 3.11. Suppose f,g9 : R > C satisfy (3.27) with g continuous. Then @ is 
a solution of (C), and the solutions of (3.27) are given by 

f =0, g arbitrary, p(x) =1, f(x) = A(x) +d, 

f(x) =csinbx +dcosbx, o(x) =cosbx, 
where A is additive and b,c, d are complex constants. 


Remark. That equation (3.26) is the same as (3.27) can be seen by substituting in 


(3.26) x — y for x and 2y for y to get f(x —y) + fa + y) = gy) ff). 
So we will solve (3.27) with g continuous. We will determine the general solu- 
tion of (3.27) in Theorem 3.46. 


Proof. When f = 0, ¢ is arbitrary. When g = 0, so does f. 
We consider when f 4 0. Choose x9 € R such that f(xo) 4 O and (3.27) 
becomes 


(3.28) 2f(xo)e(y) = f@o+y)+fo-—y) fory eR. 
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Claim that g is a solution of (C). For x, y € R, utilizing (3.28) and (3.27), we have 


2f xo)le + y) +o — y= fo+x+y)+ f@o—x — y) 
+ fo +x —y)+ f(xo-—x+y) 
= 2f(%o + x)o(y) + 2f Go — x)e(y) 
= 20(y) -2fo)e(); 
that is, @ satisfies (C). So far, no regularity condition has been used. 
Suppose g is continuous. Then, by Remark 3.10, 9 = 1 or g(x) = cosbx, 
beC. 
g = 1 in (3.27) yields f(x + y) + f(@ — y) = 2f(), which is Jensen’s equa- 
tion (J) and f(x) = A(x) +d, for x € R, where A is additive andd € C. 
With g(x) = cos bx, (3.27) becomes 


fa@t+y)+ f@—y) =2f (x) cos by, 
the equation studied in Result 3.4. Hence 


f(x) =csinbx+dcosbx forb,c,d €C. 


This proves the theorem. 


Remark 3.12. The result of Kaczmarz was extended by Flett [280]. If f : C > Cis 
a nonzero continuous solution of (C), then it is no longer true that the solution of (C) 
is of the form (3.22); that is, f(z) = coshbz, b is a complex constant, and z € C. 
What is true is the following. Let f (4 0 or 1) be a complex-valued function of a 
complex variable continuous at least at one point and satisfying (C). Then 


f(z) =cosh(ax + By), z=xtiy, a,f eC, 


not both zero. If f is differentiable at at least one point that is not a zero of sinh(ax + 
fy), then 6 = ia and f indeed has the form (3.22), f(z) = cosaz, a € C. 


This result was further extended to R” in [572]. Let f : R” — C, continuous at 
some point, be a solution of (C). If f 4 0, then 


n 
Sf (41, %2,---5Xn) = core) a; €C, x7 ER. 


i=1 


We now consider a generalization of (3.27), namely 


(3.29) faty+fa-y)=g(x)hQy), forx,yeR. 


We will determine the general solution of a generalization of (3.29) later, in 
Section 3.8. 

If g = Oorh = 0, then f = 0. If f = 0, then either g = 0, A arbitrary or 
h = 0, g arbitrary. So we consider only nonzero solutions of (3.29) and prove the 
following theorem. 
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Theorem 3.13. Nonzero, continuous f, g,h : R > C satisfy (3.29) if and only if 


1 
(3.30) h(x) =a, ga) =bx+c, f(x)= zalbx +c) 
or 
h(x) =acosbx, g(x) =dcosbx + csinbx, 


(3.30a) 1 
f@)= ald cosbx +csinbx), 


where a,b,c, d are complex constants. 
Proof. Set y = 0 in (3.29) to get 
(3.31) 2f(x)=h(O)g@) =ag(x) forx ER. 


h(O) = a = Oimplies f = 0, which is not the case, so a € 0. Using (3.31), (3.29) 
becomes 
h(y) 


> 
a 


g(x t+y)+ g(x —y) = 2g(%)- 


which is the same as (3.27). Thus A satisfies (C) and, by applying Theorem 3.11, we 
obtain (3.30) and (3.30a). 


If one is interested in obtaining only the cosine solution (not the hyperbolic 
cosine), one can consider the functional equation 


(3.32) f@ty+2d)+ fa-—yt+2d)=2f@)fi) forx,y eR, 


which is a modified form of (C). 
Theorem 3.14. (Kannappan [420]). Let f : R > C be anonczero solution of (3.32), 
where d #0. Then f is periodic with period (i) d, (ii) 2d, or (iti) 4d, depending on 
f() = 1 With f(d) = lor f(d) = —1) or f() = —1 (with f(d) = 0), and f 
satisfies (C) in cases (i) and (ii) and g(x) = f (x + 2d) is a solution of (C) in case 
(iii). 
Proof. We will consider any nonzero solution f of (3.32). 

Let y = 0 in (3.32) to get 


(3.33) f(x +2d) = f(x) f) forx ER. 


f (0) = 0 would imply f = 0, which is not the case, so f (0) 4 0. Equation (3.33) 
gives f(x +4d) = f(x + 2d) f (0) = fx) f O)’. 

Now, replacing y by y + 2d in (3.32) (using (3.33)), 
(3.34) fe +y)fO? + FO —y) =2F@FO)FO). 


Putting x = 0 = y in (3.34) shows that f (0)? = 1, so that f(0) = 1 or f() = —1. 
Case 1. Suppose f (0) = 1. 
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Equation (3.33) shows that f(x + 2d) = f(x) and then (3.32) (or (3.34)) 
becomes (C). 

Now f(2d) = f(0) = 1 and f (2d) = 2 f(d)* — 1 (see (3.23)) yields f(d) = 1 
or —1. 

Suppose f(d) = 1. Then (C) with y = d and x replaced by x + d gives that 
(i) f(x +d) = f(x); that is, f has period d. 

When f (d) = —1, (C) with y = d and x replaced by x +d yields (ii) f(x) = 
—f(x+d)= f(x + 2d); that is, f has period 2d. 

Case 2. Suppose f (0) = —1. 

Then (3.33) shows that (iii) f(x) = —f(x + 2d) = f(x + 4d); that is, f has 
period 4d. Set x = 0 in (3.33) to have f(2d) = 1, and f (4d) = —1 and let x = —d 
to get f(d) = —f(d). Nowx =d, y = —d in (3.32) to have f(2d) + f(4d) = 
2 f (d) f (—d); that is, f(d) = 0. 

The periodicity in (i), (ii), and (iii) eliminates a hyperbolic solution. 

In Case 2, define g(x) = f(x + 2d), for x € R, so that (3.33) gives g(x) = 
f(x) f 0) = —f (x) and (3.32) gives 


g(x ty) +e —y) =2f@)fO) = 2g@)g(), 


which is (C). Note that g also has period 4d. This proves the theorem. 


Corollary 3.14a. Nonzero continuous solutions of (3.32) are given by 


2nax (2n + 1)ax (2n + 1)ax 
f(x) OS or cos ————— _ or — — cos —————— 


d 2d 
forx ER, neZ. 


Proof. Use Theorem 3.9. In case (i), f is a solution of (C) with period d so that 
f (x) = coscx for some constant c, with f(d) = coscd = 1; that is, cd = 2nz. 
Hence f(x) = cos ace 

In case (ii), f again is a solution of (C) with period 2d, so that f(x) = coscx 
for some constant c, with f(d) = coscd = —1; that is, cd = (2n + 1)a and 
f(x) =cos Cot Ex 

In case (ili), f (x) = g(x — 2d) = cos c(x — 2d) for some constant c with period 
4d and f(d) = coscd = 0 or cd = (2n + 1)5 and 


f(x) = COs Curae e — 2d) = cos (ou — (2n + yr) 
2d 2 
(Qn + 1)xx 
= — cos —————_ 
2d 


Now we determine the general solution of (3.32) using Theorem 3.21 and 
Theorem 3.14. 
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Remark 3.14b. The general solutions f (nonconstant) of (3.32) are given by 


f= TN Ee 

where E satisfies (E), with (i) f(d) = 1 = f(0) = Ed), f period d; (ii) f(d) = 
—1=-—f(0) = E(@), f period 2d; (iii) F@) = 0, FO) = —1 = E(@)’, f period 
4d. The proof follows easily from Theorem 3.21. 


Before considering (C) on groups and abstract spaces, we treat the functional 
equations 


(3.35) fatyfa-y)=f@)y+ for -b*, forx,y eC, 
(3.35a) fatyfe-y=fey-foy, forx,y €C, 


arising from the identities 


cos(x + y)cos(x — y) = cos’ x + cos’ y — 1, 


2 


cos(x + y)cos(x — y) =cos’x —sin? y, forx,y €C. 


Theorem 3.15. Suppose f : C — C satisfies (3.35) for some constant b. Then 
f(x) =0 when b = Oor g(x) = +4 f (x) is a solution of (C) when b 4 0. Further, 
if f is anonzero analytic function satisfying (C), then f (z) = bcos dz, forz € C, 
where d € C. 


Proof. Let b = 0. Then y = 0 in (3.35) shows that f(0) = 0. Then y = x in (3.35) 
shows that f(x) = 0. 

Now let b ¥ 0. Set y = 0 in (3.35) to get f (0)? = b?. First y = x in (3.35) gives 
+bf (2x) = 2f(x)* — b? and then 


— 


fx +y)f(@ — y) = <[b* + bf (2x) +b? + bf 2y)] — b? 


N 


b 
= +50) +7 Gy) 


or 


2 
telfwf@)l = fut+o)+ fu—), 


whereu =x+y,v=x-—y. 
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Define g(x) = +4 f (x) so that g satisfies (C). When f (# 0) is analytic, from 
Remark 3.12 it follows that f(z) = acoshaz, a,a,z €C. 


Theorem 3.15a. Let f : C — C be a nonconstant analytic function that satisfies 
(3.35a). Then f is a solution of (3.35). 


Proof. Setting y = 0 in (3.35a), we see that f’(0) = 0. Now let y = x in (3.35a) 
and then differentiate with respect to x to have 


F'Ax)fO) = FOS) — fF") forx eC. 


Differentiating (3.35a) with respect to x and letting y = x in the resultant equation, 
we have 


f' (2x) f 0) = 2f @) fF") 
so that 


FOO) =—-f'@)F"™),  forx eC, 


which by integration yields 


f(x)? = —f'(x)? +b, 


where b is a constant. Hence (3.35a) becomes (3.35). 


3.3 (C) on Groups and Vector Spaces 


Certainly the functional equation (C) has a meaning on any group G. We take G to 
be a multiplicative group. Then (C) takes the form 


(CE) f(xy) + fy!) =2f@)fQ) forx,y €G. 


One way to study (CE) on a purely algebraic structure is through the homomorphism 
g:Go>C 


(3.36) gry) = g(x)g(y) forx,y €G. 


For example, if G is a group, then the function defined by 
1 ok 
(3.37) Fe) = sia@)+a"@)), forx eG, 


is a solution of (CE), where g*(x) = g(x)7!. 

Naturally, one would like to find out whether or not every solution of (CE) on 
an arbitrary group has the form (3.37). When G is a topological group and f is a 
continuous solution of (CE), is g also continuous? If this g exists, is it unique? Given 
g, f is determined by (3.37). Given f, is it possible to determine g through f? The 
answers to these questions are provided in this section and Section 3.4. 
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The answer to the form (3.37) is affirmative for Abelian groups (Kannappan [424, 
477, 422]); that is, every nonzero solution of (CE) has the form (3.37). As for non- 
Abelian groups, if f satisfies the additional condition 


(K) f(xyz) = f(xzy), forallx,y,z€G, 


then indeed every solution of (CE) is of the form (3.37) (see Section 3.4). On some 
special groups, f is of the form (3.37) without the use of the additional condition 
(K). We will also see examples of solutions of (CE) on a non-Abelian group not of 
the form (3.37). 


Lemma 3.16. [424, 477]. Let G be any group and g\, g2 : G > C* be homomor- 
phisms (3.36) such that 


81+ 8] = 824 8. 
Then g2 = 81 Or g2 = 8}. 
Proof. Suppose gi (x) + gf (x) = g2(x) + g3(x) for x € G. Then 


1 
gi(x) — g5(x) = g2(x) — FE 
that is, 
8i)g2(x) ~1 _ gir)g2Q%) = 1 
g2(x) ei(x) 
or 
(ei(x) «-D/ ae )=0 
eee g2(x) gi (x) 
or 


[g1(x)go(x) — 1]lgi@%)g3(x) — 1] = 0. 


Let G; = {x € G: gi(x)go(x) = 1} and G2 = {x € G: gi(x)g3(x) = 1}. Then 
G and G» being the kernels of the homomorphisms, g}g2 and g1g3 are (normal) 
subgroups, and G = G; U G2. But a group is never the union of two subgroups, for 
ifx; € G1, x2 ¢ Go, then one of x1, x2, x1x2 is not in Gj UG». So either Gj = G, 
in which case g2 = gj, or G2 = G, in which case g2 = g1. This proves the lemma. 


Lemma 3.16a. Let G be any group. Let f : G > C satisfy (CE), and assume the 
values +1 only. Then f has the form (3.37). 


Proof. Set y = e in (CE) to have f(e) = 1. Then x = e in (CE) yields f(y) = 
7), 
Since f(x) = +1 forall x € G, 
flay) + f(xy!) = £2 (from (CE)) 


and 


_ 


Fonies J ==. 
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If f(xy) f (xy—!) = —1, then either f(xy) = 1 and f(xy~!) = —1 or f(xy) = —1 
and f(xy!) = 1. In this case, f(xy) + f(xy~!) = 0, contradicting f (xy) + 
f (xy7!) = +2. Hence f (xy) f (xy~!) = 1 and 

Lf@y) — f@y)P = (fy) + F@y DP - 4f &y) fey) = 0; 
that is, fry) = f (xy!) and f(xy) = f(x) f(y) (from (CE)). Then f(x) = 
LOL) ‘and has the form (3.37). 


Lemma 3.16b. [477]. Let G be any cyclic group (finite or infinite). Then every 
nonzero solution f : G — C of (CE) is of the form (3.37). 


Lemma 3.16c. [477]. Let G, and G2 be two Abelian groups and every solution of 
(CE) on G; (i = 1,2) have the form (3.37). Then the same is true of all the solutions 
of (CE) on the direct product G, x G2. 


Lemma 3.16d. [477]. Let Gj (i € I, an index set) be a family of Abelian groups 
such that every solution of (CE) on G; has the form (3.37). Then the same is true of 
all solutions of (CE) on the weak direct product P*G; (x € P*Gj means x = (xi) 
with x; = e; for all but a finite number of indices). 


Lemma 3.16e. [477]. Let G be any group and H be a homomorphic image of G. 
If every solution of (CE) on G is of the form (3.37), then every solution of (CE) on 
His also of the form (3.37). 


Piecing together the results above, we can prove the following theorem. 


Theorem 3.17. (Kannappan [477]). Let G be any Abelian group. Then every 
nonzero solution of (CE) on G has the form (3.37). 


Proof. It is well known that every Abelian group G with m generators is a homo- 
morphic image of the weak direct product P*Z’, Z being an infinite cyclic group 
of integers. But every solution of (CE) on P*Z” is of the form (3.37) by Lemmas 
3.16b and 3.16d. Now an application of Lemma 3.16e shows that all solutions of 
(CE) on G are of the form (3.37). Since there is no restriction whatsoever on the 
cardinality m for the validity of Lemma 3.16d, we conclude that every function that 
satisfies (CE) on an arbitrary Abelian group G has the form (3.37). 


Now we prove that if f of (CE) is continuous, so is the corresponding g in (3.37). 


Theorem 3.18. [424, 477, 422]. Let G be a topological group (Abelian or not) 
and let f be a complex-valued, continuous function defined on G. Further, let g 
be a homomorphism of G into C* such that f has the form (3.37). Then g is also 
continuous. 


Proof. By hypothesis, 


* 
f(xy) = = for all x,y € G, 


_ 8)a(y) + g*(x)g*(y) 
y) > 
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and 
one 
1 
= gle@)e(v) + 8@)e"0) + 8*@)e0) + 8*@)8*O)L 
and so 
je Fejyoy st OED 
(3.38) = £@)= 2) 20) °0) 
But 
poy = SEO) 
and hence 
aa HQ =o 


2 
From (3.38) and (3.39), we obtain 
(3.40) fay) — f@)FY) = [¢@) — F@)lgO) -— FO). 


Case I. Suppose that g(x) = f(x) for every x in G. Then, obviously, g is con- 
tinuous. 


Case II. Otherwise, there is an x9 in G such that 6 = g(xo) — f (xo) 4 0. With 
xo for y in (3.40), we get 


Gal) gC) = FO) + Ff C0) ~ FOF O0)) 


From (3.41), it follows that, if f is continuous, then so is g. This completes the proof 
of the theorem. 


Remark. Note that g is given in terms of f, which will be used later. 


Remark 3.19. Let G be a topological group (Abelian). Let f be a nonzero contin- 
uous solution of (CE) on G. Then f is of the form (3.37), where g is a continuous 
homomorphism of G into C*. 

Suppose G above is locally compact and f is (Haar) measurable. Then it is easy 
to see that g is also measurable [369]. But since g is a measurable homomorphism, 
it is also continuous. Hence f is continuous. So, every measurable solution of (CE) 
is continuous. 
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(C) on a Vector Space 


Result 3.20. [427]. Let V be a real vector space. Suppose a nonzero f : V > 
C satisfying (C) is continuous along rays. Then there exists a linear function 
L:V —C such that f(x) = coshL(x). If V is a real linear topological space, 
then the corresponding L is also continuous. If f : V — R is continuous along 
rays, f #0, then every solution of (C) is of the form 


f(x) =cosL(x) or coshL(x). 


3.4 Solution of (CE) on a Non-Abelian Group 


Let f : G — C be a nonzero solution of (CE) satisfying (K), where G is a non- 
Abelian group. 

First we develop some preliminaries that will be useful in what follows [424]. 
First put y = e (identity of G) in (CE) to have f(e) = 1 (use f #0). Then x = e in 
(CE) yields 

fo )=f0) fory €G; 
that is, f is even. 

It is easy to see that (interchanging x and y in (CE)) f(xy) = f(yx) for all 
x,yeG. 

Letting y = x in (CE), we have 


(3.42) f@%)41=2fGY forx eG 
(cf. (3.23), known as the duplication formula). Using (CE) and (K), we obtain 
2f (xy) fy!) = fry -xy7!) + fey: yx7') 
= f(xy: y 'x)+ f(ex!-y’) 
(3.43) = f(x*)+ f(y), forx,y eG. 
From (CE), (3.42), and (3.43), we find that 
[f@y) — fay DP =[f@y) + fay DP -—4f Oy) fay!) 
=4f(«) fO) — 247) + FO’) 
=4f (x) f(y) —4f x) —4f 0) +4 
(3.44) = 4[ f(x)’ - IFO) - 1) 
or 
[2f xy) —2f@ FOP = 41F@) — HLFO) - U1; 
that is, 
(3.444) [Ff @xy) -— f@) FO)? =[f@) - UFO) - 1] forx,y eG 
(cf. (3.25) and (3.25a)). 
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Now we are ready to prove the main theorem. 


Theorem 3.21. (Kannappan [424]). Let G be a non-Abelian group and f :G—> C 
be a nonzero solution of (CE) with (K). Then f has the form (3.37). 


Proof. Lemma 3.16a shows that if f(G) C {—1, 1}, then f has the form (3.37). 
Suppose there exists an x9 € G such that f(xo)* # 1. Leta = f(xo) and f bea 
square root of (a2 — 1); that is, a =o*—1. 

Now define g : G > C by (cf. (3.41)) 


1 
g(x) = fx) + pit Gor) — fof), x€G, 


(3.45) = lf Gor) + (B—a)f(x)I. 


We will prove that g is a homomorphism; that is, g satisfies (3.36) and f has the 
form (3.37). First we will show that f is of the form (3.37). From (3.44a) and (3.45), 
it follows that 


1 
[g(x) — f(@®)P = gilt Coy” -1f@) - 1] 
=f) -1 


or 
g(x)? —2f(x)g(x)+1=0. 


Hence, g(x) 4 0 and moreover 


_ g(x) +8*@) 
=. 


forx € G. 


f(x) 


This proves (3.37). It remains to show that g is a homomorphism. 
By the repeated use of (CE) and (K), we have 


2LF ox) f(y) + f Hoy) fe) = Foxy) + f oxy!) + f Ccoyx) + f Groyx7!) 
= 2f (xoxy) +2f (x0) f xy) 
= 2 f @oxy) + al2f@) f(y) — fy) 


and 


2f (xox) f (coy) = f xgxy) + fay) 
= 2f (xoxy) f (0) — f(xy) + 2F@ FO) — Fry) 
= 2laf (xoxy) — fey) + FOFO)I. 
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In view of these, finally we get 


1 
g(x)g(y) = Fil Gor) + (B — a) f ®ILF @oy) + B- a) f OI 
1 
= palf Cox) f Boy) + (B — a) f (xox) f ”) + Foy) f)} 
+ (Ba) fx) fO)I 
1 
= pillaf Gory) — fey) + f@FO)} 


+ (P= a) (eoty) + 2af (2) FO) — af ey) 
+ (B° — 208 + 0°) f(x) fQ)I 

- FIBL soy) + ap — 2a? +14 6? 20 +0) F(x) f() 
+ (a? — of —1)f@y)l 

7 git Gory) + (Ba) f(xy)] 

= g(xy). 


This proves the theorem. 


The condition (K) in a way reduces the study of (CE) to the Abelian case 
[429, 588]; that is, the condition (K) reduces the study of (CE) to the study of the 
corresponding equation over the commutative group &. where G’ is the commutator 
group of G. 


Remark 3.22. [429, 588]. Let f : G > C satisfy (CE) and (K). Use (K) to get 
(3.46) f(x) = f(xx7!y7!xy) = f(xu) forx €G, 


where uw is any element of the commutator subgroup G’ of G. Note that f(u) = 
f (zyz~ly7!) = 1 by (K). 
Now define F : & — Cby 


(3.47) F(®) = f(x) forxinx¥e <. 


The function F is well defined because of (3.46), for, if y € x, then xy € G’. 
Then, by (3.46), 


fe) = fay -y) = FO) 
since xy~! € G’. (Note that f = Fo, where £: G > & is the canonical 


homomorphism.) 
It is easy to verify that F is a solution of (CE) on &. Then, by Theorem 3.17, 


there is ahomomorphism k : & — C* such that 
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K(x) +k (X G 
PaO) forx € —. 

2 G' 
Now, if we define g : G > C* by g(x) = k(X), where x is in ¥ € &, g is well 
defined and a homomorphism, and f has the form (3.37). 


As a matter of fact, the following is true. 


Result 3.17a. (Kannappan [429], Lawrence [601]). Let f : G > C be a solution of 
(CE) and f #0. Then, that f has the form (3.37) implies f (G') = 1, and if G' is a 
torsion group and f (u) = | for every u € G’, then f is of the standard form (3.37). 


Proof. Suppose f has the form (3.37). Then 


—1,,-1 * —1,,-1 
F(ayxoly-}y = SE y “8 (xyx~"y~") 


_ s@)gQ)g@7)s7) + 3*@)s*O)s*@)s*¥0™) 
2 
= 4. 


Conversely, let f(G’) = 1 and G’ be a torsion group. 

We will prove that f satisfies (K). First we will prove that if there is an element 
b € Gsuch that b” = e, forsomen € Z+ and f(b) = 1, then f(xb) = f(x) for all 
xeG. 

From (CE), we have 


f(xb-b) + f(xb-b7!) =2f (xb) f(b) forx € G; 

that is, 

f (xb*) = 2f (xb) — f@). 
Replace x by xb in the equation above to obtain 

f (xb*) = 2f ab’) — f (wb) = 3f (xb) — 2F(). 

Another replacement of x by xb will result in 

f (xb*) = 4f (xb) —3F(), 
and, continuing in a similar manner, we have, in general, 

f (xb*) = kf (xb) — (k-—1) f(x) forx eG, ke Zy. 


In particular, 
f(xb") = nf (xb) — @— Df; 
hence f(xb) = f(x), forx € G, since b” =e. 
Since f(G’) = | and G’ is a torsion group, 


f(xu) = f(x) forxeG,ueG. 
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For x,y,z € G, 
fey sr yo = F@): 
or changing x by xzy, we get 
f (xyz) = f@zy), 
which is (K). Hence f has the form (3.37). 


This could also be proved as follows [601]. Suppose a and b are in the same 
coset of G modulo G’. Then a = bc for some c € G’ andc” = e for some n € Z+ 
and f(c) = 1. So, f(a) = f(bc) = f(b). The function f therefore factors through 
& (see Remark 3.22) and is thus of the standard form (3.37). 


Solutions of (CE) on a Non-Abelian Group not of the Form (3.37) 


Example 1. Let G be the quaternion group. Instead of writing G = {e,a,a’,a°, 
b, ab, ba, ba”} with a* = e, a2 = b?, ba = ab, we write G = {+1, +i, +), +k} 
with 22? = j* =k* =-1, ij=k, jk =i, ki =j. 

Let f : G > C bea nonzero solution of (CE). We will find all f and show that 
all but one has the form (3.37). There is only one nonzero solution f of (CE) that is 
not of the form (3.37) (see also [163, 170, 429]). 

Set x = 1 = y in (CE). Then f(1) = Oor 1. But f(1) = 0 leads to f = 0. 
So, f(1) = 1. Recall f (xy) = f(yx) and f(y!) = f(y) forx, y € G. 

Let f(-1l)=0, fM=f(i)=4, fM=f-DN=h, fH =f(-/) =>. 
Now x = —1 = y in (CE) gives 2 = 26”, sod = +1. 

First suppose 6 = —1. x =i, y = —i in (CE) yields a = 0. Similarly, 6 = 
0=y.This f (fQ)=1, fEHlD)=-1, fi) =0= f(¢4/) = f(£4)) is not of 
the form (3.37). Indeed, if f has the form (3.37), (K) holds. But f (ijk) = FUE) = 
f(-)) = Land f(ikj) = f(—J) = fQ) = 1. 

Also, if there is a homomorphism g associated to f, then g(x)? = —1 for 
x eG, x +1. But —-1 = g(k)? = g(ij)* = g(i)*g(j)” = 1, acontradiction. 

The case 6 = | leads f to have the form (3.37). For x =i, y = —i in (CE) shows 
that a? = 1. Similarly, 8? = 1 = y?; thatis, f(G) = {1, —1}. By Lemma 3.16a, 
f has the form (3.37) (in this case, f itself is the homomorphism, f (ijk) = 
f(-1) = 1, and f kj) = f(-j?) = fC) = D. 


Remark. In [188], (CE) is considered without condition (K) but with the conditions 
that (i) there exists an e9 € G (group) such that f(e9) = 0 and (ii) f(xyeo) = 
f (xeoy) for x, y € G, where f : G — C, and it is shown that f has the form 
(3.37). 

Conditions (K) and (i) and (ii) are not equivalent. Conditions (i) and (ii) 
imply (K). The example of a quaternion group above illustrates the other way that 
implication is not always true—namely, f(G) = {—1, 1} satisfies (K) but not (i) and 
(ii). 

Now, to show that (i) and (ii) imply (K), first set x and y by xeo and yeo, respec- 
tively, in (CE) to have 
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2 f (xeo)f (veo) = f (reoyeo) + flay!) = fayes) + flay”). 


Then replace x and y by xyeg and eg in (CE) to get f (xyes) + f(xy) = 0. Use 
these two equations with (CE) to obtain f(xy) = f(x) f(y) — f(xeo) f (veo) for 
x,yeEG, 


f yz) = f@)f vz) — feo) f zeo) 
= f(x) f (yz) — f(xeo) f eoy) 
= f(x) f (zy) — feo) f Zyeo) 
= f(xzy), 


which is (K). 


Example 2. (Aczél, Chung, and Ng [40]). Suppose H is the skew field of quater- 
nions over R, and G is the unit sphere in H under multiplication; that is, 


H ={a+bi+cj+dk: a,b,c,d€R}, 
G={xeH: a+b 4c? 4d? = |x|? = 1). 


If we define f : G > Cby f(a+ bi + cj + dk) = a, then f is a solution of 
(CE), f(xy) = f(yx) for x, y € G, but (K) is not satisfied in general, for f (ikj) = 
f(-j- Jj) = land f(@jk) = f(k-k) = f(-1) = -1. So f cannot be of the form 
(3.37). 


Example 3. [601]. Let G = (a,b | a~'b = ba) be a torsion-free group. Define 
f:G— Cby 


f@'b’)=1  if4| 442s, 
fb’) =-1 if2|t+2s; 44442s, 
f(a'b’) =0 inal other cases, 


where f is a solution of (CE) and is not of the form (3.37). Otherwise, we would 
have f(aba~'b~!) = 1. But aba~'b~! = a? and f(a”) = —1, so f is not of the 
form (3.37). 


Example 4. In some special cases, it is not necessary to assume restriction or con- 
dition (K) to obtain the standard form (3.37). We will provide many results in this 
direction. It is necessary to assume certain conditions satisfied by G. 


Theorem 3.23. (Corovei [170]). Let G be a group and F a field with char F 4 2 
and f : G — F be anonzero solution of (CE). Suppose that (i) there is an xo € 
Z(G) (center of G) such that f (xo)? 4 1 and (ii) there is ab € F such that 
b? = f (xo)? — 1. Then f has the form (3.37). 


Proof. (See [424]). The proof in [424, pp. 72-73] works since x9 € Z(G). 
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Let f (xo) = a. First we show that 
(3.44b) [f (vox) f @o) f@)? = LF) LLP? @0)—-1] forx € G, x0 € Z(G) 
(cf. (3.44a)). Replace x by xxo and y by Key in (CE), and use (3.42) to obtain 
FR) + FOG) = 2f @x0)f xg '); 


that is, 


Fy? + Fo)? —1 = fexo) fexq) 
= f (xxo)[2f () f (to) — f x0) 
= —[{f(xxo) — fx) fo}? — f&)? f o)7] 


or 


Lf (exo) — f(x) fo)? = Fx)? fo)? — fF)? — fo)? +1 
= [f (x)? — ULF (0)? — U1, 


which is (3.44b). 
Define g: G > F* by 


1 
we) = SO) + FLf x0) = ff Co)] 
= FU F(ex0) + (b—a)f(x)] forx eG. 


Utilizing (3.42), we have 


1 
g(x) — FOP = LF @x0) — F@)F@0)P 
1 
= gal f )’ — DF Go)" — 0); 


that is, 
g(x)? —2g(x) f(x) +1=0. 


Now we can conclude that g(x) ~ 0 and f has the form (3.37). The only thing 
remaining is to show that g is a homomorphism. 

Instead of using condition (K), use x9 € Z(G). Then, as in Theorem 3.21 (see 
also [429]), 


fox) f(y) + f@oy) fx) = foxy) + a[2f x) fO) — fQry)] 
f ox) f oy) = af woxy) + fa) fy) — fay) 
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and 


1 
8(x)a(y) = Fal f ox) Froy) + & — af) Foy) + FO) F0x)} 


+ (b- a) f(x) FO)I 
= g(xy). 


This proves the result. 


Result 3.24. [170]. Let G be a nilpotent group whose elements are of odd order and 
F a quadratically closed field with char F 4 2. If f : G — F is asolution of (CE), 
then f has the form (3.37). 


Definition. A group G is said to be 3-rewritable if xx2x3 = X(1)Xo(2)X(3) for 
some g, where @ is a nonidentity permutation on {1, 2, 3}. 


Definition. A group G is said to be metabelian provided G’ C Z(G). 


Definition. Let f : G — F, G bea group, F be a field, and S be a subset of G. 
Then f is called an S-odd function provided f (xy~!) = —f (xy) forall x, y € S (a 
generalization of odd function). 


Result 3.25. [174]. Let G be a 3-rewritable group and F a quadratically closed 
field with char F 4 2. A mapping f : G —> F isa solution of (CE) if and only if 
(i) f has the form (3.37) or (ii) there exists a subset S of G such that H = G\S isa 
subgroup of G and f is an S-odd function such that f|H is a homomorphism of H 
into F. 


Result 3.25(a). [172]. Let G be a metabelian group and F a quadratically closed 
field with char F # 2. The function f : G — F satisfies (CE) if and only if (i) f 
has the form (3.37) or (ii) there exists a subset S of G such that H = G\S is a 
subgroup of G, f is an S-odd function, and f |H is of the form (3.37), where g is a 
homomorphism of H into F*. 


Result 3.26. [601]. Let f : G > C, be a solution of (CE), where the group G is 
generated by two elements a and b. Suppose that f (aba~'b~!) = 1. Then f has the 
standard form (3.37). 


3.4.1 Discussion 


Our results permit us to compute in detail all solutions of (CE) on an arbitrary 
Abelian group G. A function f on G satisfying (CE) satisfies (K) if and only if 
f is of the form (3.37). Now, g can be written in one and only one way in the form 
g = Xexp(g), where ¥ is a character of G [369] and g is a homomorphism of G 
into the additive reals. Further, if g is continuous, then so are ¥ and g. 

Indeed, since g is a homomorphism into C*, g/|g| is a character of G, say 
X. If g is continuous, then so is V. Now let g = log|g|. Then plainly enough » 
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is a homomorphism of G into the additive reals, called a real character [369], and 
|g| = exp(@). Further, it is evident that g is continuous when g is. Hence, we have 
g = X exp(g). So, computing g’s (or f’s) is reduced to computing characters and 
real characters of G. 

Let G be a locally compact Abelian group. The homomorphisms g of (3.37) 
all have the form ¥ exp(g), where V is a continuous character of G and @ is a 
continuous real character of G (in the sense of [369, p. 389]). The group G has 
the form R” x Go, where Go contains a compact subgroup (open) Jo and n is any 
positive integer [369, p. 389]. The form of g on R” is obvious, and ¢ is identically 
zero on every element of G some power of which lies in Jo. Otherwise g on Go 
is easily described by a Zorn’s Lemma argument since the image group R of @ is 
divisible. 


Example. Let 1 <n < oo. Let f be a continuous solution of (CE) on R"”, where R” 
is a topological space that is a Cartesian product of the reals R. By Lemma 3.1 6c, it 
is enough to know the homomorphisms on the component groups; that is, on R with 
the usual topology. So if f is a continuous solution of (CE) on R, then f/@Q is also a 
continuous solution of (CE) on Q, where Q is the additive group of rational numbers. 
The continuity of f ensures the continuity of the corresponding g on Q satisfying 
(3.37). Further, g(x) = g(1)* with x in Q. So |g(x)| = |g(1)|* = exp(ax), where 
a = log|g(1)| (/g| is real and greater than zero). Now g/|g| is acontinuous character 
of Q and therefore has the form exp(ifx) with f real (see [369, p. 414]). So g(x) = 
exp[(a+if)x]on Q. f = (g+g*)/2 on Q and therefore on R. Hence, all continuous 
solutions f of (A) on R have the form f(x) = cosh(ax), where a is any complex 
number. Now applying Lemma 3.16c, we conclude that every continuous solution 
of (CE) on R” is given by 


&(X1,X2,.--,Xn) + B* (XI, X2,..-, Xn) 


f(1,%2,...,Xn) = 5) ’ 

where 
n 
2(x1,%2,---,Xn) = | expl(Ap + 1B p)xp] 
p=1 
n 
= exp Sl@p + iBp)xp] 
p=1 
So 


n 
Ff (x1, %2,.--,%Xn) = cosh > & + iBy)Xp 
p=! 
This generalizes Flett’s results and earlier ones [147, 280, 439]. 
There do exist unbounded, discontinuous solutions of (CE). Consider the 
p-adic integers A, [369]. A, > Z™, where c is the continuum. Since there are 
many unbounded, discontinuous real characters of Z and each can be extended to 
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a real character on A p, it is evident that there are many unbounded, discontinuous 
solutions of (CE) on Ap. 


3.4.2 (C) on Abstract Spaces 


The equation (C) on abstract spaces such as Hilbert space, Banach space, and Banach 
algebra have been treated. We will consider some of them (see [581, 582, 118, 94, 
621, 409, 44, 577, 574, 760, 772]). 


Result 3.27. (Kurepa [581]). Let X be a Banach space and f be a complex-valued, 
continuous function on X satisfying (C) with f(O) = 1. Then there exists an A : 
X — C that is continuous and additive (A) such that f (x) = cos A(x) for x € X. 


Corollary 3.27a. Suppose X in Result 3.27 is a Hilbert space. Then f(x) = 
cos(x, xo) for a unique x9 € X dependent only on f. 


Result 3.27b. [581]. Let My, be the set of alln x n matrices over R and f be a 
complex-valued, continuous function on My, such that f is a solution of (C) with 
f() = Land f(s~!xs) = f(x) for all x € My and for every nonsingular matrix 
s € My. Then f(x) = cos(a tr x) for x € My, where a € C and tr x = trace of 
x = sum of the diagonal elements of x. 


Result 3.28. [582]. Let X be a real or complex Banach algebra with unit e and 
f :R— X bea measurable solution of (C) with f(0) = e. Then there exists a 
unique a € X such that 


2 2,4 CO n,.2n 


ax ax ax 
(3.48) A aa iar ee 7) forx ER. 


If in X there is a b such that a = b? (that is, a has a square root), then 


_ > (bx)2" __ expbx + exp(—bx) _ : 
(3.48a) f(x) = pa cn an a cosh bx; 


that is, f has the form (3.37). However, if b exists, it is not unique. 


Generally the square root of a does not exist in X, and the solution (3.48) cannot 
be written in the form (3.48a). It is illustrated by the following example. 


Example 3.28a. [582]. Let X = M2, 2 x 2 matrices over C. Let f : R > Mp be 


defined by f(t) = (: 


i) for t € R. Clearly f is a solution of (CE) and is not of 


the form (3.48a). Indeed, in this case a = f”(0) = € 5) and there is no b € M2 


such that b? = a. On the other hand, consider @ € M4, where 
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Then 


This example suggests the idea of imbedding the Banach algebra X in another 
Banach algebra X that has the property that any a € X as an element in X has a 
square root in X: ive., there is at least one element b € B such that b? = G. The 
construction of such a Banach algebra X is very simple. It is sufficient to consider 
all 2 x 2 matrices <, ) the elements x;;, yi; (i, j = 1,2) of which are elements of 
X and to define the usual matrix operations between such matrices. Introducing the 
norm in X by the formula 


2 
ll = >) ball, 
ij=l 


one easily verifies that X is a Banach algebra. In the Banach algebra X, we imbed 
(isomorphically but not isometrically) the Banach algebra X by the corresponding 


a>a=(5°) forae X. 
Oa 


Now, simple calculation shows that 


(aan CAy) =(02) 


for every a € X; i.e., in X, every element a € X has a square root. 


The function 
fi 140) 2 
iO. ey) 
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satisfies all conditions of Result 3.28, and 


a, (ft) 0 \_ e+ § é=G ) 
FO= (70) peo) =e (Cy fel 


holds for every t, where the hyperbolic cosine is defined by the series. Thus we have 
the following result. 


Result 3.28b. [582]. Let f : R — X (a Banach algebra with unit e) be a measurable 
solution of (CE) with f (0) = e and let X be a Banach algebra of all 2 x 2 matrices 
the elements of which are elements of X and for which the norm of an X € X is 
defined by the formula 


2 
IZ = So lal. 


i,j=l 


If we imbed X in x by the correspondence 
a>|(* 0 
Oa)’ 
© (t) 0 


0 5) = [expta + exp(—ta)]/2, 


then 


for every t € R, where a can be taken as 
G = 2 
a ee C4 ) forae xX, 
long) = 
and it does not depend on t. 


Baker [88] has shown with an example that the measurability assumption in 
Result 3.28 [582] is necessary. First we give the following result. 


Result 3.28c. [88]. Suppose G is an Abelian group, X a complex Banach algebra 
with identity e, and f : G — X a solution of (C). If, for some a € G, the spectrum 
of f 2a), o(f (2a)) is contained in a simply connected open subset of C that does 
not contain | or || f (2a)|| < 1, then f has the form (3.37). 


Example 3.28a of Kurepa can be generalized as follows [88]. Suppose N is 
nilpotent in X of order 2(N 40, N? = 0), and let g : R > R satisfy (Q). It is easy 
to verify that f(t) =e + q(t)N, fort € R, satisfies (C), where f : R—> X. 


Theorem 3.28d. [88]. Let N be nilpotent of order 2 in X, q : R > R satisfy (Q), 
and B : R x R= R be a symmetric, biadditive function such that q(x) = B(x, x) 
for x € R (see Theorem 4.1). The function f : R > X defined by f(t) =e+q(t)N 
satisfies (C). Then f has the form (3.37), provided there exists K : R — [0, +00) 
such that 


(3.49) |B(s,t)| < K(s)K(t) foralls,t ER. 
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Proof. Suppose E : R — X is exponential and such that 


E(t) + E(-t) 


forallt € R. 
2 


fO= 


Define S : R > X by S(t) = EOTECS for t € R. Then 


E(t) — E(— 
S(9)S(0) = [(6) — EH O- A 

_ Ee) =—£6=)=Ees4t)+£es =f) 
= 
_ f6t+)-fe-1 
~ 2 
=I[q(s+t)—q(s —1)] 


=2B(s,t)N (by (4.2)) 


N 
2 


and hence 
ISI 
2 


B(s,t) < ||S(s)I IN| foralls,r eR. 


The proof is complete if we let K (t) = aBAtE fort € R. 

In [207] it is shown that there exists a symmetric, biadditive B : Rx R — Rsuch 
that there isno K : R > [0, +00) for which (3.49) holds. Now let g be the quadratic 
function associated with such a B and let f(t) = e+ q(t)N fort € R, where N is 
nilpotent of order 2 in X. From Result 3.28d, it follows that it is impossible to embed 
X in a larger algebra X in sucha way that f could be of the form (3.37) from R to 
Xx. 

If the function K of Result 3.28d exists, it is not known if X can be embedded in 
a larger algebra X so that f (t) is an exponential cosine of the form (3.37). However, 
the following partial converse is true. 


Result 3.28e. Suppose B : R x R — R is symmetric, biadditive, and such that 
(3.49) holds for some K : R — [0,+00), and let q(t) = B(t,t) fort € R. Then 
there exists a Banach algebra X and nilpotent N of X such that f(t) =e+q(t)N, 
for t € R (an exponential cosine function), has the form (3.37). 


Result 3.28f. [88]. Suppose X is a Banach algebra and f :\0, co[—> X satisfies (C) 
whenever x > y > 0. If i f(t) = j exists, then j* = j and there exist b,c € X 
t>0+ 


such that jb = bj = b, cj =c, jc =0, and 


3.48b | ie x2b  x4b? — x8b  x5b? 
(3.48b) f@)= Ito, t4qpr +e a rt 


for all x > 0. Conversely, with such j, b, and c, if f is defined by (3.48b) for all 
x > 0, then f satisfies (C). 
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Result 3.29. [582]. Suppose X is a Banach space and Y a Banach algebra with 
identity e, that f : X — Y satisfies (C) with f(O) = e for allx, y € X, and that 
f is measurable on every ray; i.e., f (tx) is measurable as a function of t € R for 
every x € X. Then there exists a quadratic function q : X — Y satisfying (Q) such 


that 
[o@) 


f@M= rs pe forx €X. 
5 ! 


Further, q is continuous if and only if im f (tx) = e uniformly for x in some sphere. 
t> 
Tf \le — q(xo)|| < 1 for at least one x9 € X, then an additive A: X — Y exists such 
that q(x) = [A(x)f- and therefore 
1 
fG&) = slexp(A()) + exp(—AG))] = cosh A). 
If q is continuous, so is A. 


Result 3.30. [577]. Let H be a Hilbert space and L(A) the set of all linear con- 
tinuous mappings of H into H endowed with the usual structure of a Banach space. 
Let N : R > L(A) satisfy (C) for x,y € R. Suppose that (i) N(x) is a normal 
transformation for every x € R; (ii) if N(x) f = 0 almost everywhere, then f = 0; 
and (iii) N(x) is weakly continuous. Then a bounded self-adjoint transformation L 
and self-adjoint transformation L2 that commutes with L, can be found such that 


N(x) = slexp@x™) + exp(—ixN1)] = cos(x N1) 


holds for all x where Ni = L2 + iL}. 


More on Cosine 


Theorem 3.31. (H. Haruki [352]). Suppose f : C — C is an entire function and 
Satisfies 


(3.50) FOMFE+wv)+ fE—w)) =2f@fWw) forz,wec. 


Then there is an entire g : C + C such that 


[FOP + wi? +f — w)I7] 
=2/f@7IFWw)? +2lg@Pigw)? forz,weC 


and conversely. Further, the only system of entire solutions of (3.51) is f(z) = 
acosbz, g(z) = aexp(i@) sinbz, where a and b are arbitrary complex constants 
and @ is an arbitrary real constant. 


Proof. Let f satisfy (3.50). We will prove that (3.51) holds. We may assume that 
f(z) # constant. Otherwise the proof is clear. Note that f(0) 4 0. Otherwise 


f=0. 


(3.51) 
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Differentiating both sides of (3.50) twice with respect to w and putting w = 0, 
we have 


(3.52) FOS @) = f" OF). 


Differentiating both sides of (3.50) with respect to z and w, we have 
(3.52a) FOF" G+ wv) — f"@ — w)) = 2f' Fw). 


We can deduce that f”(0) 4 0. Otherwise, by (3.50) and (3.52), f(z) is a 
complex constant, contradicting the assumption that f(z) 4 constant. 
By (3.52) and (3.52a), we have 


FO) 
Ff") 


Equations (3.50) and (3.52b) and the parrallelogram identity |a + b|? + |a— b|? = 
2\a|? + 2|b|? (a, b complex) yield that 


(3.52b) fOVMFE+w)— f@—-w))=2 F@)F'(w). 


IFO?AFG+w)P +1f@ — w)I*) 


(3.51a) = 2 2 f(O) ! 2) ¢/ 2 
= 2 fF (Ww) +2 #70) IP OVIF WII. 


By (3.51a), we see that (3.50) implies (3.51) with g(z) = ait f'@), which is 


an entire function of a complex variable z. This proves the theorem. 


For the converse, (3.51) implies (3.50) (see [352]). By (3.50) and Remark 3.12 
there follows f(z) = acos bz and then g(z) = a exp(i@) sin bz. 

Let f : C > C be an entire solution of (C). Setting x = sot and y = 
(C), where s and ¢ are reals, yields 


af ( . *) f ( — =f 24). 


s—it 


7 In 


If f(z) = cos z in the equation above, by 


(=) = cos (*+*) = cos (*=*) = (=) 
we obtain 
of ( =) r(° =") = f(s) + fit) 


2 
s+it 
"() 


and so 
* _ f+) 
2 
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and then 


(3.53) | f (=) 


2 
fors,t ER. 


= fox f(it) 
2 


Thus we obtain a new cosine functional equation (3.53). 


Result 3.32. [357]. Suppose that f : C — C is an entire function. Then the 
only entire solutions of (3.53) are f(z) = 0, f(z) = exp(i@)cosaz, f(z) = 
exp(i@) coshaz, where 0, a are real constants. 


3.4.3 Jacobi’s Elliptic Function Solution 


Consider the following initial value problem, where p,q, 7 are meromorphic func- 
tions of a complex variable z for |z| < oo and k is an arbitrary real constant: 


P'@) =4(2)r@), a'@) = —P@r@), r'@ = —k’ p@a@), 
p(0) =0, ¢@) =r(0) = 1. 


A unique system of solutions for the initial value problem above is given in [375, 
pp. 66-75]. We write s,(z;k), cn(z; k), dn(z; k), respectively, for p(z), g(z), 
r(z). They are the three kinds of Jacobi’s elliptic functions. 

Suppose f : C — C satisfies (C). Replace x and y by x + y and x — y, 
respectively, in (C) to get 


fQx)+ f@y)=2fat+y)f@—y) forx,y eC, 


which with the use of the duplication formula (3.23) becomes 


fay + foY =fa+tyfa-y4+1 


(e.g., (3.35)), and thus we obtain the functional equation 


(3.54) [f@e+y)+ f@—y IL OY + £OY1 = 2f FOF A+y)f@—-y) +1). 


Result 3.33. [355]. Jf f is a meromorphic function of a complex variable z for 
|z| < ©, then the only solutions of (3.54) are f(z) = 0 and f (Zz) = cn(z; k), where 
a and k are arbitrary complex constants. 


Theorem 3.34. (Ger [308, 306, 307], Aczél and Dhombres [44]). Assume G is an 
Abelian, connected topological group. A function f : G — R is the real part of a 
continuous character if and only if f is not an identically zero, bounded, continuous 


solution of (C). 


Proof. Recall that a function V : G > Cis called a character, provided X (x+y) = 
X(x)X(y) for all x, y € G and |X(x)| = 1. 

Let ¥ be a continuous character of G. Then clearly ¥(0) = 1, and ¥ is bounded 
and ¥(—x) = X(x). So, if we set f(x) = Re¥ (x), then f(x) 4 0 and we get 
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fe +y)+ f(x —y) = ReV(x + y) + Re¥ (x — y) 
= Re[X(x)X(y) + VX)e(-y)] 
= Re[X(x)(¥(y) + X0))] 
= 2ReV(x)-ReV(y) =2f(x)f(y) forx,y EG. 


So, f satisfies (C). 
Conversely, let f : G — R satisfy (C) and be both bounded and continuous. 
Then, according to Theorem 3.17 and Theorem 3.18 [424, 477], 


1 1 
roy = 5 [20 + 25] forx €G, 
where E : G > C* is acontinuous (homomorphism) exponential function satisfying 
(E). Obviously, since f is real, we have 


1 
E(x) + Ew = E(x) + AG 


and since E is unique by Lemma 3.16 and E is also such a homomorphism, we must 
have either E(x) = E(x) or E(x) = = on G. 

In the first case, E is real and we shall show that E(x) = 1. In fact, E being 
continuous and nonvanishing at any point, it has to be of constant sign since G is 
connected. Suppose that E(x) > 0 forx € G. Since f = s(E + z) is bounded, this 
implies that E is also bounded. If there is an x9 with E(xo) € 1, we may assume 
that E(xo) > 1 (otherwise take —xo instead of x9). Then E(nx9) = E(xg)"” > co 
asn — oo, acontradiction. Thus E(x) = 1 = f(x) = Re E(x), and E(x) isa 
continuous character. 

In the other case, |E(x)| = 1 for all x € G, whence 


fQx)= 5(E) + E(x)) =ReE(x) forx €G. 


Thus, in both cases, f(x) = ReE(x), where FE: G > {z € C: |z| = l}isa 
continuous homomorphism. This ends the proof (due to Ger [307]). 


A few remarks are in order. 


Remark 3.34a. (Ger [308]). The assumption of the connectedness of G may be re- 
placed by the requirement of 2-divisibility of G and the assumption that G is Abelian 
by the assumption that f satisfies (K). In fact, the only place connectedness is used 
is to show that E is positive, which can be derived from 2-divisibility as 


l by 
Bi) =E(2-5x) = £ (5x) >0 forxEeG 


(5x need not be unique). See below in Remark 3.34c that connectedness is also not 
necessary. 
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Remark 3.34b. [308]. O’Connor in [652] proved the converse part under the addi- 
tional strong assumption that G is locally compact. His proof relies on rather heavy 
tools from abstract harmonic analysis. The proof above shows that there is no need 
to assume local compactness of G (see also Ljubenova [610)]). 


Remark 3.34c. (Kannappan [419]). A function f : G — C is nonzero, bounded, 
and satisfies (C) and (K) if and only if there is a character ¥ : G — C such that 
f (x) = ¥(x) for all x € G, where G is a group. 


Proof. Suppose f satisfies the hypothesis. Then g given by (3.41) and Theorem 3.21 
is a homomorphism from G into C*. Since f is bounded, so is g by (3.41) and 
7 by (3.37). Hence |g(x)| = 1 for all x € G. Indeed, if |g(xo)| > 1, consider 
lg (nxo)| = |g(xo)|" — ov, a contradiction to g being bounded. If |g(xo)| < 1, 


consider | — ooasn — oo. Thus g is a character of G. Hence 


_l_ 
g(nxo) 


1 — 
f@=- 5 [20+ re = = qlatx) + g(x)] = Re g(x). 


The converse is obvious (see the proof of Theorem 3.34). 


Remark 3.34d. [419]. Suppose a nonzero f : G — R satisfies (CE) and (K). Then 
either there is a character ¥ : G — C such that f(x) = Re V(x) forall x € G 
or there is a homomorphism g : G — {1, —1} and an (additive) homomorphism 
€:G— R such that f(x) = g(x) cosh(€(x)) for x € G. 


Proof. By Theorem 3.21, there is a homomorphism g : G — C such that (3.37) 
holds. Let u(x) = Re g(x), v(x) = Im g(x). Then 


1 J 
2f (x) = u(x) [ - rae + tv(x) E - nd 


Since f is real valued, either v(x) = 0 or u(x)? + v(x)* = 1; that is, either 
(3.55) Im g(x) =Oor|g(@*)|=1 forx EG. 


First we will prove that |g(x)| = 1 for all x or Im g(x) = 0 for all x. Suppose 
there exist x9, yo € G with |g(xo)| 4 1, Im g(yo) 4 0. Then, by (3.55), Im g(x) = 


0 and |g(y0)| = 1. 
Using g as a homomorphism, first we get |g(xoyo)| = |g(xo)| 4 1. Now claim 


Im g(xoyo) # 0. If Im g(xoyo) = 0, then u(xo)v(yo) = 0. Since v(yo) 4 0, 
0 = u(xo) = g(xo), which is a contradiction for g : G — C*. Hence neither 


lg (xoyo)| 4 1 nor Im g(xoyo) = 0. 
Thus |g(x)| = | for all x or Im g(x) = 0 for all x € G. In the case |g(x)| = 1 


for all x, g itself is a character of G and f(x) = 5 [g@) + g@)| = Re g(x). 
In the case of Im g(x) = 0, g is a real homomorphism of G into R*. Set g(x) = 
at and ¢(x) = log |g(x)|. Then g is ahomomorphism of G into {1, —1}, 
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é(xy) = &(x) + &(y), 
g(x) = p(x) - expe (x). 


So, for x € G, 
FO) = 5100) expelx) + Lo) expe) 
= ox) slexpelx) + exp (—E(x))]9@) cosh Ete). 
Now it is easy to deduce that the following are equivalent when f : G —> R satis- 


fies (CE) and (K): | f| => 1; f is unbounded or | f(x)| = 1; f(x) = g(x) coshé(x) 
for all x € G, where gy : G > {—1, 1} isa homomorphism and € is additive on G. 


Result 3.35. [419]. Let H be a dense subgroup of G and f : H > C bea 
continuous solution of (C) with (K). Then f has a unique continuous extension 


f*:G—>C which satisfies (C) and (K). 
Proposition 3.36. [419]. The following are equivalent on G: 


(i) For f : G > C satisfying (CE) and (K), there exists an (additive) homomor- 
phism yn: G — C(+) such that f (x) = cos n(x) for x € G. 
(ii) For any homomorphism g : G — C*, there is a homomorphism € : G > C 
such that g(x) = exp (x) for x € G. 
(iit) For any character X : G — C, there is a real homomorphism € : G + R such 
that X (x) = expié(x),x € G. 


Proof. To prove (i) = (ii), suppose g : G — C* is ahomomorphism. Now, define 
f(x) = slg) + AD! Then /f satisfies (CE) and (K). So, by (i), there is an 
(additive) homomorphism 7 : G — C such that 


2 


f(x) = cos y(x) = forx €G. 
Since, by Lemma 3.16, a homomorphism associated to f is “unique”, expin(x) = 
g(x) or exp (—in(x)) = g(x) for x € G. This proves (ii). 

Next, to prove (ii) = (iii), we are given acharacter V : G —> C. By (ii), there is a 
homomorphism é : G > C such that V(x) = expié(x), x € G. Since |V(x)| = 1, 
&(x) is real. This proves (iii). 

Finally, to show that (111) = (4), let f : G — C satisfy (CE) and (K) and be 
nonzero. By Theorem 3.21, there is a homomorphism g : G — C* associated to 
f. Set V(x) = oe and g(x) = log |g(x)| for x € G. Then ¥ is a character, g is 
a real (additive) homomorphism on G into R, and g(x) = V(x) expg(x). By (ii), 
there is a real homomorphism € : G > R such that ¥(x) = expié(x) forx € G. 
So, 


g(x) = exp p(x) - expi¢(x) = expin(x), 
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where 
n(x) = —i p(x) +o). 
Thus f(x) = cos 7(x). This proves (i) and completes the proof of this proposition. 


If G is a topological group, each of the functions involved above can be taken to 
be continuous. 


Characterization of the Cosine and Some Generalizations 


Since Cauchy equations possess noncontinuous solutions, (C) has non-(measurable) 
continuous solutions. In order to obtain the cosine solution, it is necessary to assume 
some regularity condition on /. 

As is known, the definition of the trigonometric cosine may be given using the 
theory of representation of groups. Such an approach is nearer to the introduction of 
the cosine by functional equations of two variables. The latter one can be obtained 
by investigating the equation (C), 


(3.56a) fat+y) = f@)FO)-y1—- f@)PY¥1—- £0), 


for instance, and more; this equation has in the class of continuous functions the 
solutions f(x) = cos dx and only these. The characterization is done also by using 
functional equations in a single variable (Kannappan [423], Kuczma [554], Kulkarni 
[568], Sharkovski [733]). It seems that the best way of characterizing the cosine (as 
well as the other functions) is by several (and not one) functional relations. We will 
see some in a later section. 

Let f and g be functions satisfying 


(3.56b) fx —y) = f@)fO) + g)gQ) 
and 

lim g(x) =1. 

x>0 X 


Then f(x) = cosx and g(x) = sinx. We will determine the general solution of 
(3.56b) later in this section. If, however, one wants to define cosine alone, one may 
use the equation (3.56a), whose general continuous solution is f(x) = coskx [12, 
314]. Equation (3.56b) or (3.56a) certainly characterizes the cosine function. But it 
would perhaps be desirable and at least interesting to characterize the cosine by an 
equation in a single variable. In this direction, the functional equations 


(3.23) fQx) =2f (x)? -1 


and 


i 
(3.23a) f (5) = s@j Or 
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where 


Gy ft forse [4k —1), 4k + DL, 
ow Nt “fore € 1G 1 4ES 3), FS, 21, 2, 


can be considered. 

It has been proved that f(x) = cos x is the only even solution of (3.23) that 
is twice differentiable at x = O and such that f(0) = 1 and f”(0) = —1. Also 
f (x) = cosx is the only function defined for all x, continuous in a neighbourhood 
of x = 0, satisfying (3.23a) and periodic with period 27. These results characterize 
the cosine. 

In (3.23), it would be nicer to be able to introduce the cosine before introducing 
the derivatives, with no stronger means than, for example continuity. But it has been 
proved that the condition that f is twice differentiable cannot be weakened, even 
under the additional assumption of continuity. In (3.23), it contains the sign factor, 
which is somewhat disagreeable. 

First we give a characterization of the cosine function with the help of a func- 
tional equation in a single variable based on the duplication formula (3.23) and then 
another characterization related to the parallelogram law of addition of forces. 


Theorem 3.37. [423]. Let f : R — R be even, continuous in a neighbourhood of 
0, f(x) = Ofor0 <x < Sand f(x) < Ofor 5 <x < a, and satisfy 


(3.56) fQx) =1—-2F (> = x) forx eR. 


Then f(x) =cosx forall x € R. 


Proof. First we show that f is periodic with period 27. Then we prove that f is 
continuous everywhere. Change x into x + 4 in (3.56) to get 


f(a +2x) =1-2f(-x) forx ER. 


Change x into —x in the equation above and use evenness of f to conclude that 
f has period 27. The sign of f(x) is determined uniquely for each x in R by the 
periodicity 27, evenness, and the hypothesis f(x) > 0 in [0, 5] and < 0 in [4, z]. 

Rewrite (3.56) as f(x) = 1-2f(5 - )*. As x varies over Ja — €,a + 
eL, (5 — 3) €] — §, §[. This shows that f is continuous at z. When 5 — x varies 
over ]z —€, tm +e[, 2xe ]—a —2e€, —a7 + 2e[ f is continuous in Jz — 2€, a + 2€[. 
Proceeding in this way, we can conclude that f is continuous at all points. Finally, 
we will show that f is determined uniquely at every point. 

As f is continuous everywhere, it is enough to prove that f takes unique values 
at every point x = . k any integer, and n any integer 2 1. The proof is based on 
induction on n. 

Putting x = S in (3.56) and using f even, we get f(z) = 5 or —1. But since 
f(x) < Ofor x € [5,2], we have f(z) = —1. Letting x = 4 in (3.56), we get 


1 
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f (0) = 1 fork an even integer and f(kz) = f(z) = —1, fork an odd integer. Now, 


suppose that f takes unique values for all x of the form x = a. Then, as 


kx\  , /1 1 (2"-!_k)x 
5()=+ 5 a ro )} 


by using the induction hypothesis and the fact that f has a fixed sign at every point, 
we see that f takes unique values at every point x = me Thus f takes unique values 
at every point. But the function cos x satisfies all the conditions of the theorem and 
hence f(x) =cosx for all x. 

Of course, the equation (3.56) can be reduced to 


(3.23) g(2x) =2g(x)?-1, xeER, 


and then the solution of (3.56) can be found using [554] by the following procedure. 
Now define a function g : R — R such that 


gix)=—f(r—x), xeR. 
Then, from (3.56) and using f even, we have 
2g(x)* =2f(@ —x) =1-— f2x—2)=1+gQx), xeER. 


Hence g satisfies the duplication formula (3.23). As f is continuous in a neighbour- 
hood 0 and so in a neighbourhood of z, we see that g is continuous in a neighbour- 
hood of 0. 

Since f is periodic with period 27, we see that —g(x) > O for S$ Sx SF 
and g(x) S$Ofor> SxS r, and g has period 27. From [554, 733] we see that 
g(x) = cosx for all x in R. Hence f(x) = cos x for all x inR. 


Remark 3.37a. f(x) = +5 satisfies (3.56) and f is continuous and even. It does 
not satisfy f(x) < Oin[4, z]. 


Remark 3.37b. f(x) = cos (2k + 1)x, k any integer, is even, continuous, and satis- 
fies (3.56) but not the sign condition. 


The problem of finding out whether these are the only even, continuous solutions 
of (3.56) remains open. 
3.4.4 More Characterizations in a Single Variable 


The function f : R —> R defined by f(x) = acosn(x +b) for a,b real and n an 
integer has the following properties: 


(i) f is continuous and has period 27. 
(ii) f is differentiable, and there exist real constants c and d such that 


f'(«)=cf(«+d) forallx ER. 
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(iii) Given any real constants c, c’, d, d’, there exist real constants d1, d, such that 
cf (x +c) + df (x +d’) = di f (x + d;) forall x. 


Now, we consider the converse case by choosing from properties (i) to (iii) to 
determine f and then connect to the parallelogram law of addition of vectors. 


Theorem 3.38. (Robbins [705]). Suppose f : R — R is continuous with period 
2a; that is, (i) holds. Then, if f has either property (ii) or (ili), then f(x) = 
acosn(x +b) for some constants a, b and an integer n. 


Proof. [705]. Suppose f has property (ii). Then f has derivatives of all orders 
with period 27. So, f can be represented by a convergent Fourier series that can be 
differentiated term by term. Thus, for some constant ky, 


(oe) 
f@= ykn expinx, 
—oo 


[ee 


fees DS iknn expinx. 
—0o 
Then 
foe) 
O= f’(x)-cf« +d) = DknGin — cexpind) expinx 
CO 


yields, for every n, ky (in — expidn) = 0. So, for f # 0 (if = 0 nothing to prove), 
there exists an n for which k, # 0 and in = expidn. Thus n = +d. Hence there 
exists at most two values of n for which k, # 0, and these values are negatives of 
one another. Thus, for some integer n, 


f(x) =k, expinx +k_,exp(—inx) forx €R. 


Since f is real valued, it follows that k_, = ky, and the proof is complete. 

For the proof when f satisfies (iii), see [705], and we apply this part to derive 
the law of addition of forces. 

We shall now apply Theorem 3.38 (iii) to derive the law of addition of forces. 
For simplicity, let us consider only forces acting at a fixed point in a fixed plane in 
which the angular coordinate x is defined. With such a force, we identify the real- 
valued function F (x) that specifies the scalar component of the force in the direction 
x; thus a force is represented by a real-valued function of period 27. By the sum of 
two forces Fj (x) and F2(x), we mean the function Fj (x) + F2(x). Our assumptions 
are the following. 


All forces are geometrically similar. By this we mean that there exists a fixed function 
J (x) of period 27 such that any force F(x) can be written in the form 


(3.57) F(x) =ar f(x +bp), 
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where af and bf are constants determined by F(x). We need not assume that all 
values of the constants af and bf can occur in (3.57), but we shall assume that there 
exist at least the forces F,(x) = f(x) and Fo(x) = f(x + 1). 


The sum of two forces is a force. This implies that the function f (x) has the property 
that, for certain constants c, d and for every x, 


(iv) fe +1) + Ff) =cf@ +d). 


The function f (x) is continuous, nonconstant, and vanishes for at most two values 
in the interval0 < x < 2z. 


Theorem 3.38 (iii) shows that the function f(x), continuous, real valued, of 
period 27, and satisfying (iv), must be of the form 


f(x) =acosn(x +b), 


where n is an integer. The hypotheses of (iii) ensure that n can be chosen as 1. The 
parallelogram law of addition of forces is an immediate consequence. 


Now we treat d’Alembert’s restrictive functional equation. 


Let X be areal pre-Hilbert space with dim X > 3. In the class of real functionals 
f : X — R in [Fi1] three functional equations were considered: (C) and two 
conditional forms of (C), namely 


(Ci) f+y)+f@-y)=2FG)FO), {forx,y eX: (x, y) = 0}, 
(Co) f@ty)+f@-y)=2F@)fO), f{forx,y eX: lll] =Ilyll. 


Now we compare the class of solutions of (C;) or (C2) with that of (C). It is easy 
to see that functionals f : X — R defined by f(x) = exp A(||x||") forx € X, 
where A : R — R satisfies (A), are a solution of (C;) but not (C). So, the class 
of solutions of (C) is a proper subset of the class of solutions of (C;). But with the 


addition of a suitable condition on /, the two classes of solutions will be identical. 


Theorem 3.39. [281]. A nonzero function f : X — R is a solution of (C) if and 
only if f satisfies (C1) and the duplication formula (3.23). 


As regards (C) and (C2), the following is true. 


Theorem 3.40a [281]. A nonzero f : X — R is a solution of (C) if and only if f 
satisfies (C2) with f (0) = 1. 


Proof. It is obvious to see the necessary condition. Now, to prove the sufficient 
condition, assume f satisfies (C2) and f(0) = 1. Obviously, the duplication formula 
(3.23) holds. So, by Theorem 3.39, it is enough to show that (C;) holds. Let x, y ¢ X 
with (x, y) = 0. If ||x|| = |[y||, then (C1) holds. 


Suppose ||x|| yi lIyll with (x, w= = 0. Then, since IIx nian = ||x — y|| and 
x+y x-y 
IF = 1% = 


2. 
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f(2x) + fy) =2f(% + yf —y), 


fx) +f) =2F (=) r (: - *) 


Squaring the latter equation and using (3.23) in the first equation, we get 


2 = 2 
1+ f+ fee —y) +2Fe) 0) = 4s (2) (=) 


But if f satisfies (3.23), then 


2 2 
x+y x —y 
4r( ; ) r( 5 ) = f+ fe—y)+fe+y)+f@-y)t1 
(see Fochi [281]). Thus (C) holds. 

Now we determine the general solution of (3.27) [12, 830, 426, 477]. Let f,o : 
G — F, with G a group, satisfy 


G27) fa +y)+ f@—y)=2f()e(y) forx,y €G, 


with f satisfying (K). We have to modify the proof in Theorem 3.17 (where f, @ : 
IR > C are continuous) to find the general solution. 

When f = 0, @ is arbitrary. When g = 0, f = 0. We consider now nontrivial 
solutions of (3.27), f #0, and g ¥ 0 and prove the following theorem. 


Theorem 3.41. The most general solutions f,9 : G > C satisfying (3.27) with f 
satisfying (K) are f = 0, 9 arbitrary; 9 = 1 and f is Jensen’s equation given by 
(3.58); and v(x) given by (3.59) and f by (3.60). 


Proof. From Theorem 3.11, it follows that g also satisfies (K) and (C). 
When g = 1, f satisfies (J) and 


(3.58) f(x)=A(x)+b forx eG, 


where A : G > Cis additive. 
Let 9 € 1. From Theorem 3.21, it follows that 


_ 8) +8") 


(3.59) p(x) 5) 


forx € G, 


where g is a homomorphism G to C*. 
Set x = 0 in (3.27) to obtain 


(3.60) f(y) + f(y) = 2ke(y) =k(g(y) + 8*Q)) fory eG. 
Interchange x and y in (3.27) and add it to (3.27) and use (3.60) to have 


2fx+y)+ fx —y)+ fy —x) =2f(x)e(y) + 2fO)e@); 
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that is, 
2f(x+y)+k(g@—y)t+ 8° @—y)) = FA)EO) + 8°O)) + FOE) + 8*@)) 
or 


k 
fa@ty)- zs +y)+a*@+y)) 


(vy) + 8°) 
2 


bx) eo") 
2 > 


k 

= Ee = pis): 0) 
k 

+ Ee = 5 (80) + “| 


meaning 


(e(y) +8*0)) (g(x) + g*(x)) 
2 2 : 


(3.61) Fa@+y)= F(x) iy) x,yeG, 


where : 
F(x) = f@)- 5 (g) + g*(x)) forx eG. 


Apply associativity and use (3.61) to have 
1 
F(x ty +2) = FIF@)8O) + 8°O)) + FOVEO) + 8° O)IG@) + 8°@) 
1 
+ FQ): se +) +8" + y)) 


1 1 
= F(x)- 5 (8 +z)+a*(ytz))+ glFOVE@) + g*(z)) 
+ F(z)(g(y) + 8° O))(g@) + 8*(@)), 


which on simplification becomes 
F(x)(g(y) — 8*(y))(8@) — 8° @)) = F@)(8@) — 8°@))(@O) — 8*0)). 


Now g(x) = g*(x) for every x means g(x) = 1. Indeed, g(x)? = 1. If there is an 
xo with g(xo) = —1, then —1 = g(x0) = g (22)? = 1, acontradiction. 
Then g(x) = 1, which is not the case. Thus 


F (x) = (g(x) — g*(x)) 


and 


k 
(3.59a) FC) = se) + g(x) + b(g@)— g*(x)) forx eG. 


This proves the theorem. 
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Result 3.41a. (Carmichael [143]). Let G be a group whose elements are of odd 
order, Z(G) the centre of G, F a field with char F 4 2, and F(b) the extension 
field of F by an element b. Suppose f,g : G — F satisfy (3.27). Suppose there 
isau € Z(G) such that p(u) 4 1. Then o and f are given by (3.59) and (3.59a), 
respectively, where g : G — F(b)* is a homomorphism. 


Definition. A group G is said to be a P3-group if the order of G’ (the commutator 
subgroup of G) is either one or two. 


Definition. Let G be a group, F a field with char F 4 2, and C a subset of G. 
Then f : G > F is called a C-odd function if, for all x € C and y € G, f(xy) = 
—f(xy7'). f is said to be C-even provided, for all x € G and y € C, f(xy) = 
fay"). 


Result 3.41b. (Corovei [171]). Let G be a P3-group and F a quadratically closed 
field with char F 4 2. The pair of mappings f,@ : G — F is a solution of the 
equation (3.27) if and only if it has one of the following four forms: 


(i) f =0and ¢g is arbitrary. 

(ii) f(x) is given by (3.59a) and @ is given by (3.59), where g is a homomorphism 
from G into F*. 

Gii) f(x) = g(x)[g(x) +4], g(x) = g(x), forall x € G, where g is a homomor- 
phism from G into F*, y is a homomorphism from G into the additive group of 
F, and a is an arbitrary element of F. 

(iv) There exists a nonempty subset C © G such that H = G \ C is a normal 
subgroup of G that contains Z(G) and the subset {x?| x € G}, and 


f(@) =aa(x) + fox), g(x) =a(x) forallx EG, 


where a is a C-odd function, a | is a homomorphism from H into F*, f2 is 
a C-even function such that f2(x) = a(x) f(x) for all x € H, where f is a 
homomorphism from H into the additive group of F, and f2(xy) = f2(x)a(y) 
for allx € C and y € H, where a is an arbitrary element of F. 


In (C), replace f by sf and take g(x) = 1 to get 


(3.62) fO)fO)=faty)+ fa—y)gt) forx,y €G, 


where f, g : G > F, G is a uniquely 2-divisible Abelian group, and F is a field of 
characteristic different from two. 

A pair (f, g) is an admissible solution of (3.62) if g(0) = 1 and f(O) = 2. 
Now, we characterize the Abelian groups with the property that, in every admissible 
solution (f, g), the second component g must be exponential. It is easy to see that 
f = Ei + Eo and g = E\E2, where EF), E2 : G — F satisfy (E) and (3.62). 
In Theorem 3.42, we will prove that they are the solutions of (3.62) under certain 
conditions. We begin with the following example, where g is not an exponential 
function. 
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Example. If S| > 2, (3.62) admits an admissible solution (f, g), where g is not 
an exponential function. 
Define f, g : G > F by 


0, forx ¢2G, —1, forx ¢2G. 


2, forx €2G 1, forx € 2G 
re=| =| 


It is easy to see that (f, g) is an admissible solution of (3.62) since S| > 2 and 
there exist y,z ¢ 2G such that y+z ¢ 2G. Then g(y + z) = —1, whereas 
g(y)g(z) = (—-1)(-1) = 1. So, g is not exponential. 


We now proceed to show that if S| < 2, then every admissible (f, g) has g 
exponential. We consider only f 4 0. 

First, for all y € G, g(y)g(—y) = 1 and g(y) £0 for all y. Set x = 0 in (3.62) 
to get (using f (0) = 2) 


fy) = fCy)go) = fO)g(y)g(y) forall y € G. 


So, whenever f(y) 4 0, g(y)g(—y) = 1. Suppose f(yo) = 0. Then f(—yo) = 0 
from f(—y) = f(y)g(—y). 


f(x + yo) + f(% — yo)g(yo) = 0. 


This shows g(yo) 4 0 or otherwise f(x + yo) = 0; that is, f = O and f(2yo) + 
2g (yo) = 0 (let x = yo). Similarly, (3.62) gives f(x — yo) + f(x + yo)g(—yo) = 9; 
that is, 2+ f(2yo)g(—yo) = 0. So, 


2+ g(—yo)(—2g(yo)) =0 or g(yo)g(—yo) = I. 


Next, define h : G x G > F by 


A(x, y) = g(x + y)g(x)'g(y)'| = ge + y)e(—x)g(-y) forx,y €G. 


It is obvious that g is exponential if and only if h(x, y) = 1. It is easy to check that 
h is symmetric and satisfies [212] 


@ fat+yfat+ty+z2nrgo,g=fart+yfat+y+z) forx,y,z€G, 


h(-y,y) = 1 = h(O,y), and the cocycle equation h(x, y)A(x + y,z) = 
h(x, y + z)hQy, Zz). 

If f never takes the value 0, then h = 1 and so g is exponential. Assume 
h(y,z) # | for some y, z € G. Claim y,z ¢ G. 

By (i), f@ +y)f@+y+2z) = 0 for all x € G. With x = —y, we see that 
2 f(z) = 0; that is, f(z) = 0. Similarly, f(y) = 0. By (3.60), 


fa@+tyt+tafarty) = f@x+2y+z)+ f@eat+y). 
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Hence f (2x +2y+z) =O forall x € G. If z € 2G, choose x € G with2x +z = 0. 
Then f (2y) = 0. Equation (3.60) yields (with x = y) f(y)? — f(2y) = 2g(y) = 0, 
which cannot be. Similarly, y ¢ 2G. 

Before proving the main result (Theorem 3.42), we show that when S| = 2, 
g is exponential. From the argument above, it follows that if y or z is in 2G, then 
h(y,z) = 1. Suppose y, z ¢ 2G. Then, since S| <2,y+z € 2G. In the cocycle 
equation, take x = —y to obtain 


h(-y, y)h(0,z) = 1-l=h(-y,y+z)hQy,z) =1-hG,z). 


Thus h(y, z) = | in all cases and g is exponential. 


Theorem 3.42. [212]. If (f, g) is an admissible solution of (3.62), then there is an 
extension field K of F with [K : F] < 2 and exponential functions E,, Ey: G > F 
such that f = E, + Eo, g = E\ Ed. 


Proof. In [424, 477] the solution of d’Alembert’s equation (C) was framed for the 
case F = C; however, examination of the proof shows that the following is true. 

If f : G > F isa solution of (C) with f(0) = 1, then there is an extension 
field K of F with [K : F] < 2 and an exponential function E : G — K such that 
2 f(x) = E(x) + E(—x) forall x € G. 

By assumption, G = 2G, so g is an exponential function. Define a : G > F by 
a(x) = 5 f (2x)g(—x). Then a satisfies d’ Alembert’s equation (C) and a(0) = 1. 
Hence there is a field K with F C K, [K : F] < 2, and an exponential function 
E:G—>Lwith 


f (2x) = E(x)g(x) + E(—x)g(x). 


Now let x € G be arbitrary. Since G is uniquely 2-divisible, there is a unique 
element 5 € G. Define EZ} = E(5)g(5), Ex = E(—4)g(%). Then Ey, E2 are 
exponential functions with E;(0) = E2(0) = 1. Also Ej + E2 = f(2-5) = f(x) 
and E\(x)E2(x) = g(5)g(5) = g(x). This completes the proof of the theorem. 


Now we turn our attention to sine equations. 


Sine Equations 


This section is devoted to sine functions. We will treat several functional equations 
connected to sine functions. The next section is devoted to both cosine and sine 
functions. First we start off with the sine functional equation 


(S) fxtyfa@-y=fe-fO)Y forx,y ER. 


Theorem 3.43. (Aczél [12], Aczél and Dhombres [44], Kurepa [581]). Suppose 
f : R= Ris a continuous (integrable, measurable) solution of (S) for x,y € R. 
Then f (x) = cx orcsinbx orcsinhbx, for x € R, b,c, are real constants. 
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Proof. Since f is a continuous solution of (S), it can be proved that it is differentiable 
and has derivatives of all orders. As before, we need derivatives of order two only. 

f = 0 isa solution of (S). Let us assume that f 4 0.x = 0 = y in (S) yields 
f(0) =0. 


Interchange x and y in (S) to have 


fa +y)fa-y=-fatyfO—x) 
or 
WFO) =-fWFC»r); 
that is, f is odd since f # 0. Differentiate (S) first with respect to y and then with 


respect to x to get 


f"atyfe—-y=fat+yf"@—-y) 
or 
f" fo) = fu) f"(). 


Since f 4 a0, we have f”(u) = kf (wu), a differential equation. As in Theorem 3.15, 
we get f(x) = cx or cy cosbx +d sinbx or c; coshbx + asinhbx, b,c,c1,d € R. 
f (0) = 0 yields c, = 0. This proves the theorem. 


Remark 3.43a. (Kannappan [429], Kurepa [581]). The only continuous solutions 
f :R— Cof(S) are f(x) = cx orcsinbx, b,c € C. 


Now we will determine the general solution of (S). 


Theorem 3.44. (See also Kannappan [477]). Let G be a 2-divisible group. If f : 
G — Cand acomplex-valued function on an arbitrary additive group G is a solution 
of (S) satisfying (K), then f is either additive, satisfies (A), or 


(3.37a) f(x) =k(g(®)-— g(-—x)), kEC, x EG, 


where g : G > C* is a homomorphism. Furthermore, every measurable solution of 
(S) on a locally compact Abelian group is continuous. 


Proof. Note that f = 0 is a solution of (S). Let us assume that f # 0. First 
x =0=yin(S) yields f(0) = 0. 

Now we will prove that f is odd; that is, f(—x) = —f(x) for all x € G. 
Changing y to —y in (S) yields f(y)? = f(—y)*; that is, f(—y) = +f(y) and 
x = 0in (S) gives f(y) f(-y) = —f OY. 

If f(y) = 0, then f(—y) = 0. If f(y) # 0, then f(—y) = — f(y); that is, f 
is odd. Or if f(—y) = f(y), then f(y) f (-y) = FO) = —FOY or FO) =0 = 
f (—y). Otherwise f(—y) = —f(y), so f is odd. 

Also f(y)(fQ)+ f(-y)) = Oand f(—y)(f(y) + f(-y)) = 0. Again (f(y) + 
f(—y))* = 0, yielding f odd. 
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(S) can be rewritten as (use 2-divisibility of G) 


u+ov - n= oy? 
(S1) fuse) = F( 5) ) -#/ 5) ) foru,v €G. 


Since f 4 0, there is ana such that f(a) 4 0. Define h : G > C by 


1 
(3.63) h(x) = af@ lt & t+a)—f(x-a)] forx EG. 


By the repeated use of (S1), (K), and (3.63), we get 


2 are: 
iret y+ for=yyif@ = ¢(S**) — (AR *) 


2 2 
- (=) + (=) 
= fa)fy +a) — fa) —a) 
=2f(@h(y)f@) 


or 


(3.27) faty)+ fe —y)=2f@)hy) forx,y €G, 


which is (3.27). From Theorem 3.41, we see that since f satisfies (K) and so does h, 
h satisfies (C). 

If h = 1, then f satisfies (J) and there is an additive A : G — C such that 
f(x) = A(x) +b for x € G. Since f (0) = 0, b = O and f is additive. 

Let h € 1. From Theorem 3.41 (note that h = 0 implies f = 0), it follows that 
there is ahomomorphism g : G > C% satisfying g(x + y) = g(x)g(y) forx, y EG 
such that 


(3.64) h(x) = sore 


Interchange x and y in (3.27) and add the resultant to (3.27) using f odd and (K) to 
get 


(3.65) fa +y)=fR)AY) +h) FY) forx,y eG. 
Using associativity a couple of times and (3.64), we obtain, for x, y, z € G, 
FAY HZ) = FH + Yh] +h& + Wf) 


=[f@AYy) + fO)AG)IA(Z) +h + yf) 
= f@Mh(y +z) FAG) O)A) + f(A): 


forx €G. 
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that is, 
[hx + y) —h@)AO) F(Z) = [AQ +z) —AO)A)ISO), 


meaning 
1 1 
Foor + g*(x)g*(y)) - qs) + g*(x))(g(y) + “| FQ) 


1 1 
= {Steet + g*(y)g*(z)] - gis) + g*(y)(g(z) + *@)| f(x) 
or 


(g(x) — 8° («))(g0) — 8 O)NF] = (80) — 8*))(g@) — 8*@)F(@). 


If g(x) = g*(x) for every x, then g(x)? = 1. Suppose there is an xo such that 
g(xo) = —1. Then -1 = g(xo) = g (2+) =8 (22)? = 1, a contradiction. 
So g(x) = | for every x, and then h(x) = 1, which is not the case, and f 4 0. Thus 


Fe) =Ke@)—g*@)l, forx eG. 


From the measurability of f follows that of / in (3.63) and then that of g in (3.64). 
But then g is continuous (being a measurable homomorphism; see Hewitt and Ross 
[369]). So, f is continuous. Thus the proof of this theorem is complete. 


We will study (3.65) in a later section. 


Result 3.44a. Suppose f : C — C is a nonzero entire solution of the functional 
equation (S). Then f is of the form 


f(z) =cz or f(z)=csinbz, forz€C, whereb,c €C; 


cf. Remark 3.43a. The result follows easily from Theorem 3.44. Also see H. Haruki 
[348]. 


As a special case of (S) on C, consider 


(S2) Ife +iy) fe —iy)| =If@)?- fO)1 forx,y ER. 


What are its solutions? Are they the same as the solutions above? The answer is the 
following. 


Result 3.44b. (Smajdor and Smajdor [754]). The only entire solutions of order 
D; p <4 of the equation (S2) are the same as the solutions above. 


As in Lemma 3.16, we can show that g is unique in (3.37a) when G is a cyclic 
group and prove the following. 


Theorem 3.44c. (Kurepa [579]). Let G be a cyclic group. Let f be a complex- 
valued function on G satisfying (S) with the properties that (1) f is not identically 
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zero and (2) is of the form (3.37a). Then we assert that there is one and only one 
homomorphism g of G into K satisfying (S). 


Proof. Suppose that there are two homomorphisms g; : G ~ C (i = 1, 2) such that 


gi(x) — g(x) 


(3.37a) f(x) = 5 


for allx € G, 


8 (x) = gi(—x) = a 
Now (3.37a) gives ‘hat 
gi(x) + gi (x) = ga(x) + 93 (x) 


for all x € G, and equivalently 


gi(x) — g2(x) + gi (x) — g3(x) =0; 
hence 
gi(x) — ga(x) 


gi) — go(x) + Poon =‘0 


which is the same as 


[egi(x) — go(x) [gi (x) g2(x) + 1] = 0. 


From the above, we conclude that, for each x, either 
g2(x) = gi(x) or g2(x) = —g7 (a). 
Let G be generated by a. Then, we would have either 
g2(a) = g1@), 


which in turn implies that g2 is the same as gj, since G is a cyclic group and there is 
nothing to prove, or 
g2(a) = —g87 (a). 
Now 
g2(a*) = ga(a)” = (—gi(a)) = gs} (@’). 
But 
g2(a°) = —gy(a*) or gi(a’). 


In the former case, g7 (a*) = 0, which cannot be. So, 8h (a*) = gi (a’); that is, 


gi (a) = +¢1(a). 


If gi (a) = gi(a), then f = 0, which is not so. So, g} (a) = —g1(a); that is, go = g1. 
This proves the theorem. 
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Theorem 3.45. (Kurepa [581]). Let H be a real Hilbert space. If f : H > Risa 
nonzero continuous solution of (S), then there exists an x9 € H such that 


f(x) = (x, x0), or bsin(x,xo9), or bsinh (x, x0), 


where b is a real constant. 


Proof. For a givenx € H,x #0, andt € R, define f, : R— Rby f-(t) = f(tx). 
Clearly f; satisfies (S) on R and is continuous. So Theorem 3.43 implies 


fx(t)=a(x)t or a(x)sinb@)t, fort eR, 


where b(x) is real or purely imaginary. 
Let us first consider the case f; (t) = a(x)t for x € H. So, 


(3.66) f (tx) = f(t) = a(x)t = f(x) forallx eH. 


Let Ho = {x € H: f(x) = 0}. Claim Ap is a closed linear subspace. 

Suppose x € Ho. Equation (3.66) shows that tx € Hp fort € R. Letx, y € Ap. 
Then (S) implies f(x + y) = O or f(x — y) = 0. Suppose f(x + y) = 0; that 
is, x + y € Ho. Replacing x, y in (S) by x + y and x — y, respectively, to have 
fat+y) — f(x —y)* = f(x) — fy) = 0, shows that (x — y) € Hp. (Also 
tx,ty € Ho, t € R; that is, —y € Hp andso x — y € Apo.) If f(x — y) = 0, then 
x — y € Ho. Hence Hp is a linear subspace (the case f(x — y) = 0 is similar), and 
let H, be the orthogonal complement of Hp = {x € H: (x, y) =O forall y € Ao}. 

Assert that the dimension of Hj is one. Suppose ej, e2 are two orthonormal 


elements of Hi. Then f(e;) 4 0, f(e2) # 0. Let to = Fn. Then f(e,) + 


tof (e2) = f(e1) + f (toe2) = 0 or f (e1)* = f (toe2)?. Now, with x = e1, y = t9e2, 
(S) yields f (e1 + toe2) f (e1 — toe2) = 0; that is, e; + t1e2 € Ho, where 16 = 17. But 
e1 +t1e2 € Ho implies (e; + te2, e1) = 0, which cannot be. So, dim H; = 1. 

The continuity of f implies that Ho is a closed linear subspace, so that every 
x € H can be written as x = (x, e)e + ho, where ho € Ho and H; = span{e}, with 
lle|| =1. 

Set y = (x, e)e in (S) to have f(x)* = f((x, e)e)’; that is, 


f(x) = £f (x, ee) = £(x, e) fe) = ex) ga), 


whose e(x) = +1, g(x) = (x, f(e)e). Obviously, g(x) satisfies (S). With f(x) = 
e(x)g(x) in (S), we get 


e(x + y)e(x — y)gix + y)g@e — y) = g(x + y)g(®— y). 


This shows that e(x + y)e(x — y) = 1; that is, e(x) = e(y) for all x, y € H. Thus 
f(x) = (x, x0) with x9 = f(e)e or x9 = —f (ede. 

The case f(tx) = a(x) sinb(x)t for all x € H,t € Ran be argued similarly 
to produce f(x) = asinh (x, xg) with a suitable x9 € H (see [409]). See [409] also 
to prove that a “mixed solution” cannot occur; that is, either f (tx) = a(x)t for all 
xé€H,teRor f(tx) =axsinb(x)t forallx € H, t€R. 
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Corollary 3.45a. Let M,;, = set of all matrices of order n over R endowed with the 
usual topology of matrices and f (4 0): Mn — R be a continuous solution of (S) 
with f(s~!xs) = f(x) for x € My, ands € M, orthogonal. Then f has one of 
the forms following f(x) =atrx, asinb(trx), ora sinh b(trx), where a, b are real 
and tt x = trace of the matrix x. 


Proof. If x = (x;;) and y = ();;) are two matrices from M,, and if we set 


n 
9) = Do ayy. 


i,j=l 


then M,, becomes the Hilbert (Euclidean) real space. Since the functional f satisfies 
all conditions of Theorem 3.45, we find that f has one of the forms 


f(x)=A(x), f(x) =bsinA(x), f(x) =bsinh A(x), 


where A(x) = (x,xo) and xg is some constant vector matrix from M,. For every 
real number ¢ and for every orthogonal matrix s, we have f (s~!txs) = f (tx), which 
implies A(s~!xs) = A(x). Indeed, this is obvious in the first case. In other cases, 
we have 


sintA(s~!xs) = sintA(x), sinh tA(s~!xs) = sinht A(x). 


If we take the derivative of these equations with respect to f and then set f = 0, 
we get the desired result. Thus the functional A(x) is orthogonally invariant. This 
continuity of A(x) and A(x + y) = A(x) + A(y) imply A(x) = atrx with some 
constant a ({[575, Theorem 7]). Q.E.D. 


3.4.5 More on Sine Functions on a Vector Space 


Theorem 3.46. (Baker [87]). Let X be a real vector space, and suppose f : x ->C 
is a nonzero solution of (S). Then f is continuous along rays if and only if either 
(i) f is linear (that is, additive and real homogeneous) or (ii) there exists a linear 
map A: X — C anda complex constant c such that 


f(x) =csin A(x). 


Proof. By Theorem 3.44, if f is additive, then we have (i) since, for additive func- 
tions, continuity along rays is equivalent to real homogeneity. 

Now suppose there exists F : X — Candk € C such that F(x+y) = F(x) F(y) 
and 


f(x) = k{ F(x) — F(-x)} 


for all x € X. We will show that F is continuous along rays. Clearly we may assume 


f #9. 


3.4 Solution of (CE) on a Non-Abelian Group 159 


If f(a) 4 0, then by [87] we have 


f(r + Ia) — f(r — Ya) 4 f(ra) 


ea) = 2fa 2k 


for all real r. Since f is continuous along rays, the mapping r > F (ra) is continu- 
ous. If f(a) = 0, choose b € X such that f(b) ~ 0. Then 


f(a+b)f(a—b) = f(a’ — fer =—-fby £0 


and hence f(a — b) 4 0. Thus the mappings r — F (rb) andr — F(r(a — b)) are 
continuous. But F(ra) = F(r(a — b)) F(rb) for all real r, and hence r > F (ra) is 
a continuous mapping. 

Thus F is continuous along rays and so there exists a linear function A : X > C 
and a complex constant k such that 


f(x) = kfexp {A(x)} — exp {—A(x)} 


for all x € X. Thus (ii) follows by letting A by — iA and c = 2ik. The converse is 
obvious. 


Corollary 3.46a. [87]. Suppose f : R” — C satisfies (S) for all x, y, € R". If there 
exists a measurable subset K of R" of positive measure such that the restriction of f 
to K is measurable, then f is continuous and hence either 


n 
(i) there exists {aj}"_, G C such that f(x) = > ajx; for all x = (x;) € R" or 
i=1 


(ii) there exist c € C and {a;}"_, C C such that 


i=l = 
n 
fQ@)= csin( 5 a) forx €R". 
i=1 


Note that the continuity assumption is weakened to measurability, and refer to 
Theorem 3.45. 


There are other functional equations, namely 
y 2 
(3.67) faety) + f@ —y) =2f() (1 97 (3) ) forx,y ER, 
2 
fx-y)=f@) (1 —2F (5) ) 


2 
~ £0) (1-2/ (5) ) forx,y él 
3.68) fF @+Y)+F/@-WI=S@)S@y) ——forx,y ER, 


which also characterize the sine function [663]. We now investigate the relationship 
between them and (S). We begin with (3.67) and (3.67a) and prove the following 
theorem. 


(3.67a) 


A 
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Theorem 3.47. (See also Parnami and Vasudeva [663]). A nonzero function f : 
R => C satisfies (3.67) if and only if f satisfies (3.67a). Furthermore, f satisfies (S) 


and 
y\? x\2 
(3.67) f(x +y) = F(x) [ -2f (5) + £0) [ 25 (2) 
and f is given by 
f(x) =c(E(Qx)— E(-x)) forx ER, 


where E is exponential (E). If f is continuous, then f(x) = csinbx, b,c €C. 


Proof. f = 0 is a solution of (S), (3.67), and (3.67a). We consider only the nonzero 
solution f. 

Suppose f is a solution of (3.67). Set y = 0 in (3.67) to get f (0) = 0. 

Now put x = 0 to have f(y) + f(—y) = 0; that is, f is odd. 

Interchange x and y in (3.67) to obtain 


x\2 
(3.69) f(x +y)— fey) = 2F0) [1-27 (5) iF xy eR. 


Adding (3.67) and (3.69), we have (3.67b). Subtracting (3.69) from (3.67), we obtain 
(3.67a). 

Conversely, let f be a solution (nonzero) of (3.67a). With y = x, (3.67a) gives 
f ) = 0. Now x = 0 in (3.67a) shows that f is odd. Changing y to —y in (3.67a), 
we get (3.67b). Add (3.67a) and (3.67b) to have (3.67). 

Let f be a solution of (3.67) ((3.67a)). Letting y = x in (3.67), we have f (2x) = 
2f(x) (1 —2F (§)’), xeR. 

From this and (3.67a) and (3.67b), we get 


f&+y)P— f@—y)? =4f@) 1 =f (5) | fy) [ —2f (5) | 


= f(2x)f@y) forx,y ER, 


which is (S). 
From Theorem 3.44, it follows that 


fx) =A) or f(x) =c(E@)—E(-x)), x eR, 


where A is additive and EF is exponential. 

But f(x) = A(x) in (3.67) gives A(x)A(y)* = 0; that is, A = 0 and so f = 0, 
which cannot be. So, f(x) = c(E(x) — E(—x)). If f is continuous, then f(x) = 
csinbx, c,b € C. This proves the theorem. 


Remark 3.47a. [663]. This theorem shows that the equation (3.67) is preferable to 
the equation (S) for the characterization of the sine function. Obviously (S) does not 
imply (3.67) when f # 0. 
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Let us turn our attention to (S) and (3.68). It turns out that H = {x € R: f(x) = 
0} plays a vital role in this investigation. 


Theorem 3.48. Suppose f : R — C is a solution of (S). Then f satisfies (3.68) 
and H is a subgroup of R. 


Proof. From (S), first replacing x by 5 + y, y by 5, and then x by 5 and y by 5 —y, 
we have 


A 


rofety=s (Sty) —F (3) forxyel 
fofe-n=F(Z) -F(G-y) forxyel 


a 


Adding the two equations above and using (S), we have (3.68). It is easy to verify 
that H is a subgroup. 


Remark 3.48a. [663]. The converse is not true; that is, that (3.68) does not imply 
(S) can be seen by the following example: 


x, forx €Q, 
Ff) = é for x irrational. 
f is continuous at 0 and does not satisfy (S), but f is a solution of (3.68). 
Indeed, for x € Q, y € irrational, f (x+y) f (x—y) = 0, but f(x)*— f(y)? = x?. 
So, (S) is not satisfied. 
As for (3.68), consider the following cases: 


(i) If y € irrational, then both sides of (3.68) are zeros. 
(ii) Let y € Q for x € irrational, and f(y)(f@+y)+fa-—y)) =0= f@)fy). 
(ii)For x € Q, fY(fa+y)+ f@—y) =y- 2x = f(x) f(y). 


In all cases, (3.68) holds. 


Now we show that H = {x € R: f(x) = 0} plays a crucial role in proving the 
converse. 


Theorem 3.48b. (See also [663]). Suppose f : R > C satisfies (3.68) and H is a 
subgroup of 8. Then f is a solution of (S). 


Proof. Recall that f is odd. Note that whenever x, y € H, then 2x,x +y € H. 
Several cases are to be considered. 


Case (i). Let x, y € H. Then f(x + y) f(x — y) =0= f(x)? — f(y). 


Case (ii). Let x € H, y ¢ H. Then f(x) = 0 = f(2x), f(y) # 0. Equation (3.68) 
yields f(y)(f(« + y) + f@ — y)) = 0; that is, f( + y) + Ff — y) = O and 
fiery) f@=)==f6+9yY = fea). 

Now x + y ¢ H; otherwise y € H. So, > ¢ H. Similarly, (x — y) ¢ H, and 
<> ¢ H. Further, + ¢ H, and either 5 € Hor} ZH. 
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First, let | ¢ H. Now (3.68) gives 


DUG) G-Dl-/@ 0-0 


(oP) 


Now let 5 ¢ H. Now (3.68) gives 
BUGS) +1 Dos ror-0 


£()- 62) -0 (use f is odd). 


From (3.68), we get 


ie a i) 


= (2) re-». 


that is, 


that is, 


and since f (5) is either = f (452) or = —f (45), we have, since f(x) = 0, 


either f(y) = f(x — y) or f(y) = —f@ — y). 
In either case, 
f@ty)f@—-y) =-f@-y)? = f@) - £0). 
Thus, (S) holds. 


Case (iii). Let x, y ¢ H. Then 5, s € H. Either — € H or = ¢ H. Suppose 
<> © H. Then (x + y) € H and +> ¢ H (otherwise, x, y ¢ H). Now (3.68) 


i 2 
gives 


AP) aH) 


=1 (2) re-»=0; 


that is, f(x) + f(y) = Oand f(x + y)f(@ —y) = 0 = f(x)? — FO). So, (S) 
holds. 
Suppose > ¢ H. Then either = € H or 4% ¢ H. 


First, let =" € H. Then (x — y) € H and 


APG G-) 


=1 (2) re+y=0; 
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that is, f(x) — f(y) = 0 (use f odd) and 
fe +y)f@—y) =0= fay - fOY. 


So, (S) holds. 
Finally, let =* ¢ H. Then 


Tos Wemece cee) 


= (=) fe+y) 


and 


Multiplying the two equations above and noting that f (*3*) 4 0, f (5) 4 0, 
and f is odd, we get (S). 
This proves the theorem. 


As a prelude to finding conditions for solutions of (3.68) to be solutions of (S) 
also, we have the following proposition. 


Proposition 3.48c. [663]. Let f : R — C be a solution of (3.68) but not of (S). 
Then H is a dense subset of R. 


Now, the following result is self-evident. 


Result 3.48d. [663]. Let f be a nonzero continuous mapping from R into C. Then 
every solution of (3.68) is a solution of (S) and vice versa. Moreover, f is of the 
form cx, csinbx, forb,c € C. 


From Theorem 3.47, it follows that every solution of (3.67) is a solution of (S). 
But the converse does not hold. Whereas a nonzero additive A is a solution of (S), 
A is not a solution of (3.67). The following theorem gives conditions for a solution 
of (S) also to be a solution of (3.67). 


Theorem 3.49. (See also [663]). Suppose f : R > C satisfies (S). Let H = {x € 
R: f(x) = 0}. [fH 4 2H, then there is anonzeroc € C such that F = cf satisfies 
(3.67). 

Proof. By hypothesis, there isan a ¢ H — 2H. So, $ ¢ H; otherwise a € 2H. Let 
c=5 7 and define F = cf. Clearly, F is a solution of (S), F(a) = 0, F ($) = 1. 
Use (S) and (S1) to show that F is a solution of (3.67). 
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Consider 


F@+y)+F@—y)= F@ +y)F (5) + F@-y)F (5) 
= x+y a a x+y a 2 
- ( 2 7) = 2 7) 
2 2 
x—-y @4 Xx-y @ 
+F(*77 49) -F( 2 -4) 


= FQ)F (y+ 5) + FOF (-y+ 5) 


= rool r(o+8)-" 6-9) 


Now y = $ yields F (x + §) + F( — £) = F(x)F(a) = 0, so that 
F (x + $) =—F (x — $) forall x € Rand 


3.70) Fa+ty)+ F(x —y)=2F(Q)F (y ra =) forx,y ER. 


eer rey 
a eo 
=F (8) 603) 

om =F (48) 7) F649) 


Thus, (3.70) and (3.70a) show that F is a solution of (3.67). Q.E.D. 


Now we treat the functional equation 


(3.71) gx—y)=s@)sO)+f@)f(y) forx,yeR, 
where f and g are complex-valued functions defined on reals (cf. (3.5)) that are 
related to (3.67). 

If g = 0in (3.71), then f = 0. If f = 0, then g = 0 or 1. We consider only 
nonconstant solutions of (3.71). 


Theorem 3.50. (See also Vaughan [814], Kurepa [580, 579], Aczél [12], Aczél and 
Dhombres [44], Parnami and Vasudeva [663], Klee [539]). Suppose f,g : R> C 
are nonconstant solutions of (3.71). Then f satisfies (3.67). Further, g(x) = 1 — 


2f (3)? and f and g also satisfy 
(3.27) f@t+ty)+ f@—y) =2f(@)g0), 


(3,72) gixt+y)=8@)e0) -f@)FO), 
(3.73) faty=f@)gsO)+fO)g@) forx,yeR, 
(3.73a) f@-—y)=f@)go)-g@)f) forx,y ER, 
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f is a solution of (3.69), and g satisfies (C). Moreover, f and g are given by 


(3.37a) f(x) =k(E(x)— E*(x)) forx ER, 
(3.37b) g(x) = Frew forx ER, 


where E : IR — C* is a homomorphism satisfying (E), k? = t 
If one of f, g is continuous, then 


g(x)=cosbx, f(x)=dsinbx, b,céEC, 
with d? = 1. 


Proof. Refer also to Theorems 3.17 and 3.41. 
Interchange x and y in (3.71) to get g(x — y) = g(y — x); that is, g is even. 
Replace y by —y in (3.71) to have 


(3.74) gsx+y)=s@)so)+f@)fCy) forx,yeR. 
Change x to —x and y to —y in (3.71), and use (3.71) to have 
f@FO) = f(x) f(-y) 
or, since f #0, 
f(x) =kf(-—x) =k f(x) and R=], 


When k = 1, f is even. Now (3.74) and (3.71) show that g is constant, which is not 
the case. So, f is odd. Now (3.74) becomes (3.72). 

Adding (3.71) and (3.72), we get that g is cosine (C). y = 0 in (3.71) yields 
g(0) = I since f is odd. 

Now, y = x in (3.71) gives f(x)* + g(x)? = 1, and (3.72) gives 


g(2x) = g(x)? — f(x) =1-2f (x)? 


or 


x\2 
e(x)=1=2F (5) forx € R. 
Replace y by y + z in (3.71), and use (3.71) and (3.72) and associativity to have 


ga —y-z=sh)got+z2+f@)fOtz), forx,y,zeR 
= sa)[gO)a) — fMFMO]+FMFO +2) 
= g(x — y)g(z) + f@ — y) fF @) 
= [g@)g) + f@)FO)I8@) + f& — VF @) 
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or 


fOMLFY +2) - fO)g@) = FLAG — y) + g@)FO)I: 


that is, 


fQ +2) — fO)8@) = f@MhAGY) 


for some h (obtained by specializing x with f(x) 4 0). 
Now, interchange z and y to have (use f 4 0) 


fg) + fOAY) = feed) + fO)AG) 


or 
fO)AG® — g@) = FEA) — gQ)) 
or 
h(y)=g(y)+cf(y), fory eR, 
and 


fO+2) = fO)a@) + fe) + cf()). 


Change y to —y and z to —z, and use g even and f odd to get 


-—ft+z2)=—-fO)es@ — f@aed) +cf O)F(), 


which by addition yields c = 0, and so (3.73), (3.73a), (3.27), (3.67), and (3.69) 
hold. 

Since g satisfies (C) by Theorem 3.21, we get (3.37b) for some homomorphism 
E:R— C*. Using this g in (3.71), we have 


E(x—y)+E*—y) _ EQ)+E*@) EQ)+E*W) 


5 5 5 + f@)FO); 


that is, 
E(x) — E* E(y) — E* 
pop) = EOE) £O)= £0), 
which yields (3.37a). 


It is now easy to obtain the continuous solution. This proves the theorem. 


Now we treat a sort of converse. 


Theorem 3.51a. [663]. Suppose a nonzero f : R > R satisfies (3.69) or (3.67). Let 
g(x) =1-2f ae Then g and f satisfy (3.71). 


Proof. We will first treat the case where f satisfies (3.69). 

Set y = 0 in (3.69) to have f(0) (1—2f (§)) = 0. 1f1 —2f (§)° = 0, then 
from (3.69) it follows that f = constant, which is not the case. So, f (0) = 0. 

Then x = 0 in (3.69) shows that f is odd, and g(x) = 1 —2f es gives that g 
is even and g(0) = 1. 
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In (3.69), first replace x by x + z and then replace y by y + z to have 


f@+yt+z)—f@—y+z) =2fO)a+2), 
f@+yt+z)—f@—-—y—-—z) =2f04+2z)a(), 


which by subtraction yields 


F@-—yt+z)—f@—y -—-2)=2f04+z)e) —2fO)e@ +z) 
=2f(z)g(x—y) (using (3.69)). 


Thus, 


(3.75) fO)g@ +2) = g@)fot+z2)—-f@g@—y) forz,x,y ER. 


Putting x = 0 in (3.75), we have 


(3.73) fy +z) = fg) + fO)g(z), 


which is (3.73). Then we obtain (3.73a). From (3.75) and (3.73), we obtain 


fOVMg@ +z) — g(x)g(z)] = —f@Ig@ — y) — g&)g(y)]. 


Specializing in z, we get 
(3.7 1a) g(x —y) = g(x)a(y) + FO)K(), xy ER, 


for some function k. 
Interchanging x and y in (3.71a) yields f(y)k(x) = f(x)k(y) or k(x) = cf (x), 
and (3.71a) becomes 


(3.71b) a(x —y) = g(x)g(y) + cf (x) fF). 


First put y = x in (3.71b) and then y = —x in (3.71b) to get 1 = g(x)? + cf (x)? 
and g(2x) = g(x)? — ef (x)?. But g(2x) = 1—2f (x)? = (1 — ef @)) — ef @). 
So, c = | and (3.71b) becomes (3.71). 

Suppose f satisfies (3.67). From Theorem 3.47, it follows that f is a solution of 
(3.71). 

This proves the theorem. 


Theorem 3.51b. (See also [663].) Let g : R — R be a nonzero solution of (C). 
Then there is an f : R — R such that f and g satisfy (3.71). 


Proof. From Theorem 3.21, we see that 


_ E(x) + E*(x) 


g(x) 5 


forx €R, 


where EF is a homomorphism from R to R satisfying (E). 


168 3 Trigonometric Functional Equations 
Define f(x) = EG@)E"@) for x € R. Then 


_E@-y)+E*@—y) EQ)+E*%) EY) +E*O) 
gt —y)— g@)gQ) = ———— 


1 
= qlE@E*O) + E*(x)E(y) — E@)E() — E@)E*0)] 


_ E(@x)—E*(x*) EQ)-£E*Q) 
7 2 2 
= f)fQ). 


This completes the proof of this theorem. 


Now, we treat the functional equation connected with the sine functional equation 
(S) and prove the following theorem. 


Theorem 3.52. (See also Corovei [172].) Let f and g be nonzero real-valued func- 
tions of real variables satisfying the equation 


(3.76) fede -—x) =f (5)a(F)-F(e-F)a( —2), x,yeER. 


2 2 2 
Then g is a solution of (S), and f and g are given by 
(3.77) g(x) = AC), fe) =c+dA(x), 


(3.77b) g(x) = D(E(x) — E(-x)),  f@) = clE(@) — E(-x)] + dE(-x), 

for x €R, where A satisfies (A), E satisfies (E), and b,c, d are constants. 

Proof. Replace x by x + y and y by 2x in (3.76) to have 

(3.78) f(x t+ yg —y) = f@)g)— fO)s(y) forx,y eR. 

Interchanging x and y in (3.78), we have 
fat+y)sy-—x)=-f@+y)g@-—y) or fwgo)=—-fWes(-), 


and since f # 0, g is odd. 
First set y = — x in (3.78) to have 


(3.78a) fO)g 2x) = g(x)\F@) + f(-x)) forx ER. 
Now, change y to —y in (3.78) to have 
(3.78b) f@—ygat+y)=f@)sQ)+ f-y)sO), x yER. 


Subtract (3.78) from (3.78b) and use (3.78a) to have 
f(x —y)g@ty)— fa +y)e@-—y)=sO)F-y) + FO); 


that is, 


(3.79) fo)gtu) — fs) = fO)gu—v) foru,o eR. 
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Case I. Suppose f (0) = 0. 


Then f(v)g(u) = f (u)g(v) or f (uv) = cg(u) for some constant c since g 4 0. This 
f in (3.78) shows that g is a solution of (S). 


Case II. Suppose f (0) 4 0. 
From (3.78a), (3.78) and (3.78b), 


Ff) g(x + y)g(x — y) 


PYER oP) 


| 
= Fee (3) f (S)+F (-3)|+/@se@e (5) [ 


=—f(0)soy + fO)Psh)* 
thus g satisfies (S). 

Since g is a solution of (S), by Theorem 3.44, g(x) = A(x) or g(x) = 
b[E(x) — E(—x)] for x € R, where A satisfies (A), E satisfies (E), and b is a 


constant. 
When g(x) = A(x), (3.79) yields 


FOAM) — AO) = AWS) — A) FW 


— 
NIX wp 
— 
+ 
i 
| 
Nl< 


| | 
w]e wo] & 
‘eet amet 
EE 


or 
AWU)(F(e) — FO)) = AW&)(FW) — FO); 


that is, 
ft) =c+dAtu), foru eR, since A 40. 


Corresponding to g(x) = b(E(x) — E(—x)), (3.79) gives 
E(u) E(v) 
E(v) 7 E(u) 


f (0) = f@EW) — E(-u)] — fW[EQ) — E(-»)); 
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that is, 


fO)[E Qu) — E(2v)] = f)[EQu) — NE) — fW[E Qo) — NEW) 


or 
[E(2u) — I][f@)E) — fO)] = (Qo) — DIFWMEW — fO)] 
or 
SJE) — f(O) = c(EQv) — 1), 
meaning 


f(vo) =c[E(v) — E(-v)] + dE(-v) forv ER. 


This completes the proof of this theorem. 


Corollary 3.52a. [172]. Nonzero continuous solutions are given by 


g(x) = ax, f(x) =bx +c; 
f(x) =csinax+dcosax, g(x) =Dsinax; 
g(x) = bsinhax, f(x) =csinhax +dcoshax, xeéER. 


3.4.6 Sine Solution 


If one is interested in obtaining only the sine solution or in eliminating the linear 
solution cx and hyperbolic solution sinhcx, one can consider either (Van Vleck 


[813]) 

(3.80) fa-y+d)—fatytd)=2f@)fO), xy ER, 

or (Kannappan [428]) 

(3.81) h(@w+y+d)h(x—y+d)=h(xy—-hQy, x, yeR. 

First we treat (3.80) in Theorem 3.53 and then consider (3.81) in Theorem 3.53a. 
Theorem 3.53. (See also [813].) Suppose f (4 0): R > C satisfies (3.80), where 


d is a nonzero real constant. Then f is a periodic function with period 4d, g(x) = 
f ( +d) is a solution of (C), and f(x) = Se where E satisfies (E) with 
E(d)? = —1. Moreover, if f is a nonzero continuous solution of (3.80), then f is 
given by f(x) = (-1)" sin Cnt 2. for x eRneZ. 


Proof. Let y = 0 in (3.80) to have f(0) = 0. Replacing y with —y in (3.80) shows 
that f is odd (that is, f(—y) = —f(y)) fory € R. 

Put x = —d in (3.80) to get f(—d) = —1 = —f(d). Set y = d in (3.80) to 
have f(x) = —f(x + 2d) = f(x + 4d); that is, f has period 4d and f(2d) = 0. 
Replacing x by x + d and y by y +d in (3.80), we get 


fa@-ytadt+fatytdad=2fx+afy ta); 
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that is, p(x) = f(x +d) satisfies (C) and has period 4d. Since f (2d) = —f (0) = 0, 

g(d) = 0. From Theorem 3.21, we see that 

Eix)+2(=%) 
2 


where E is a solution of (E). Hence 


g(x) = with E(d)* = —1, 


_ EQ)E(4) + E(-x)EM) 


f(s) = pe =a) - 
_ EQ) — E(-x) 
= Ed) forx € R. 


Suppose f is continuous. Then g is continuous and g(x) = cos cx, for some constant 
c, with period 4d; that is, cos cx = cosc(x + 4d) and 4cd = 2na orc = aa Hence 


nn 
f(x) =cosc(x — d) = cos aq ™ —d) 


(= =) 
= COs = 
S\ od (2 


Since f (2d) = 0, cos a = 0 orn is odd. So, 


fOecn (= te _ Qa t “*) 


; [eo 
= sin |} —————_ — na 


2d 
(2n + 1)xx 
2d 


=cosnz sin ( 


This completes the proof of this theorem. 


Now we take up (3.81). 


Theorem 3.53a. [428]. Let h be a nonzero complex-valued function of a real vari- 
able satisfying (3.81) for x,y € IR, where d(¢ 0) is a real constant. Then h is 
periodic with period 2d, g(x) = h(x +d) is a solution of (S) with period 2d and 
only this, and h(x) = ray (E() — E(—x)), where E is a solution of (E) with 


nex 


E(d)* = 1. Further, if h is continuous, then h(x) = bcos (nz) sin “7 forx € R. 


Proof. Let x = 0, y = 0 in (3.81) to get h(d) = 0. Replace y by —y in (3.81) and 
use (3.81) to have h(y)* = h(—y)? for all y € R. So, h(—d) = 0. x = —d in (3.81) 
gives h(y)h(—y) = —A(y)?. 

So, if h(y) & 0, then h(—y) = —A(y). But if h(y) = 0, since h(y)? = h(-y)’, 
h(—y) = 0; that is, / is an odd function. 

Letting y = d in (3.81) and using f (d) = 0, we get 


h(x + 2d)h(x) = h(x)’. 
Hence h(x + 2d) = h(x) whenever h(x) 4 0. 
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Suppose h(xo) = 0. Replace x by xo and y by xo + 2d in (3.81) to get 
h(xo + 2d) = 0. Thus 


h(x + 2d) = h(x) forall x € R; 


that is, 4 is periodic with period 2d, h(2d) = h(0) = 0. 
Now change x to x + d and y to y + d in (3.81) to have 


h(x +y+d)h(x —y +d) =h(x +d)? —h(y +d)’; 


that is, g satisfies (S), where g(x) = h(x +d) and g has period 2d. This eliminates 
the additive solution, and from Theorem 3.44, we see that 
g(x) = b(E(x) — E(—x)), where E is a solution of (E). Since g(d) = h(2d) = 0, 
we have E(d) — E(—d) = 0 or E(d)? = 1. Then h(x) = g(x — d) = DESAY 
forx ER. 

As for the continuous solution, 


g(x) = bsincx = bsinc(x + 2d), 
so that cd = nz and 
h(x) = g(x —d) =bsin ~o —d) 


| nax 
= bcosna ae forx €R, ne Z. 


3.4.7 Characterization of the Sine 


As in the case of the cosine, the sine function can also be characterized with the help 
of a functional equation in a single variable. It is natural to expect to assume some 
additional conditions to achieve this. 


Theorem 3.54. (Dubikajtis [231] (cf. Theorem 3.37)). g(x) = sinx is the only 
solution satisfying the following conditions: p : R > R satisfies 


(i) g is a solution of the equation 
(3.82) 2e(x) +9 (> — 2x) =1 forxeR, 


(ii) @ is odd (i.e., p(—x) = —o(x)) forx ER, 
(iii) ¢ is positive for x €]0, FL, and 
(iv) @ is continuous in a neigbourhood of 0. 


Proof. Claim 


(v) @ is periodic with period 27. 
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Replace x by —x in (3.82) and use (11) to imply 


that is, 


Change x to x — $ to get 
p(x) =o —x), 
which by replacing x by x + z yields 


g(x) = —9(—x) = —9(a +x) = 92a +x). 
So, g has period 27. 
Some remarks are in order. 


Remark 1. Condition (ii) cannot be replaced by (v), as can be seen by the example 
1 
p(x) = 3: 


Remark 2. Condition (iii) is necessary, as can be seen by the example g(x) = 
(—1)" sin[(2n + 1)x], where n € Z +. But condition (iii) can be replaced by if 
0O<x< 5,0 < g(x) < Fe 


Remark 3. Condition (iv) can be replaced by the continuity of g in JO, €[ (use 
condition (ii)). 


To continue the proof, the sign of g(x) is uniquely determined by (1), (ii), (iii), 
and the periodicity of g. 

Indeed, (ii) shows (0) = 0 and (i) yields g (5) = 1. As x varies in ]0, $[, 
m= — xX varies in es ax[. Then g(x) = g(a — x) shows that g(x) > 0 in JO, z[ and 
(ii) implies g(x) < 0 in ] — z, O[. Now, the periodicity implies that g has a unique 
sign at every point on R. 

Claim now that g is determined uniquely at all points of the form x = aa for 
ke Zandne Z,. 


The proof is by induction on n. For n = 0, consider o (45) = g(2km) = 
(0) =0. 
ka 
=| 


Suppose the value of g (5) is determined for all k. From (3.82) there follows 


kn 1 21-2 _ kr 
lor) Ayo [Pl oer yp 


and the sign is determined uniquely. The proof of continuity of f on R from (iv) is 
similar to the proof in Theorem 3.38. Thus, there is only one f satisfying (i) to (iv), 
and g(x) = sin x satisfies all these conditions. 


174 3 Trigonometric Functional Equations 


3.4.8 General Trigonometric Functional Equations—The Addition and 
Subtraction Formulas 


From (3.1) and (3.5) arise the functional equations 


(3.73) fat+y)=f@)go)+ fO)E@), 
(3.73a) f@-y)= f@)sQ) — fO)s@&), 
(3.72) ga t+y)=s@)g)—-f@)fO), 
(3.71) ga —y)=8@)gO)+f@)fO), forx,yeG, 


where f, g : G > F, Gis a group, and F is a field. 

Note that f(x) = sinx and g(x) = cosx satisfy all these equations on R. 
These functional equations have been investigated together and individually by sev- 
eral authors (see, among others, [12, 33, 1, 40, 426, 162, 580, 488, 816, 308, 306, 
733, 729, 282, 156, 830, 831]). It was observed that (3.71) alone is enough to char- 
acterize both sine and cosine, which none of the remaining equations are able to 
do. 

Functional equations connected to hyperbolic functions include 


(3.72a) g(x +y)=g(x)g(y) + fa) fQ), 
(3.71a) g(x —y)=g)g(y)—f@)fQ), forx,y eR, 


where f, g : R > C. Obviously, hyperbolic functions g(x) = coshx, f(x) = sinhx 
are solutions of (3.73a) and (3.71a) and (3.73) and (3.72a). It can be shown [816, 
830, 831] that the general solutions of (3.71a) and (3.73a) and (3.72) and (3.72a), 
respectively, are given by 


_ Exe) + Ei(-x) 
= a a 


_ E(x) = Ex(-x) 


g(x) f() 5 , 


and 


_ — oe E\(x) ee, 


g(x) 
where E; (i = 1, 2) are exponential functions satisfying (E). 
First we will consider the functional equation 


(3.71) g(x —y)=e)g0)+f@)fQ) forx,y <¢G, 


where f, g : G > Cand G isa group. 
Proof. Suppose g = 0. Then f = 0. 

Suppose f = 0. Then g(x — y) = g(x)g(y) = g(y — x). That is, g is even and 
g(x + y) = g(x)g(y), so that g(x + y) = g( — y), meaning g(x) = constant, say 
c. So,c = c? orc = Oorc = 1. If f(x) = constant, then so is g. We will consider 
only nontrivial (nonconstant) solutions. 


3.4 Solution of (CE) on a Non-Abelian Group 


Interchanging of x and y in (3.71) yields 


a(x —y) = g(y —- 2); 


that is, g is even. Now we will show that f is odd. 
Replace y by —y in (3.71) to get 


(3.82) saxty) =s)so)+ f@)f(-y) forx,y eG. 
Change x to —x in (3.71) to have 

(3.82a) sx+y)=s@)sO)t+ f—x)f(y) forx,y eG. 
Thus 


FOF Hy) = fEHx) FO): 


that is, f(x) f(y) = f(—x) f(y) and f («= f(—x)?. Hence, f(-x) = 


forallx EG. 
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E f(x) 


If f (xo) = 0, then f(—xo0) = 0 = —f (xo). Suppose f (xo) 4 0 and f(—x0) = 
f (xo). Then f (x) f (xo) = f (—x) f (—xo), implying f(x) = f(—+x), for all x, since 
f 40. That is, f is even and (3.71) and (3.72) show that g is a constant, which is 


not the case. 
Hence f is odd and (3.82) becomes 


(3.72) saxty) =s)s)—f@)fO) forx,y eG, 
and g satisfies (C) (add (3.71) and (3.72)). 
Further, applying associativity and (3.72), we get 
gx +y+z)=[e@)gQ) — F@)FO)a@ — F@ + yf) 
= g(x)[g)s@) — fO)F@)] — F@)FO +2); 


that is, 
[faet+y)—fO)s@)f@ =(fO +2) -— fFO)E@F&) 
or 
(3.83) fat+y)— fO)g) = hy) f@), 
where 


1 


Oa 


[f+ zo) — fO)e@o)] 
with f (zo) £0. 


First, setting x = —y in (3.83), we have f(y)(A(y) — g(y)) = 0 for all y, and 
then interchange x and y in (3.83) to get f(x)(A(y) — g(Qv)) = fO)[AG) — g@)]. 
Then 0 = f(x)f()(A(y) — g(y)) = FO) AC) — g(x)); that is, AG) = g(x), 


since f #0. Hence 


(3.73) fa+y)= f)go)+ fg) forx,y €G. 
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Replace y by —y in (3.73) to obtain 


(3.73a) f(x —y) = f@)sy) -— fO)s@) forx,yeG. 
Suppose g satisfies (K). Then, by Theorem 3.21, 


(3.84) g(x) = fore forx € G, 


where E : G > C* is a homomorphism, satisfying (E). This value of g in (3.71) 
results in 


E(x)— E*(x) EQ)-E*Q) 


F@)FO) = => 5 


forx, y € G, 


and since f 4 0, 
(3.85) f(x) =k(E(x) — E*(x)) forx €G, 


with k? = i Thus we have proved the following theorem. 


Theorem 3.55. Suppose f, g : G > C satisfy (3.71) with g satisfying (K). Noncon- 
stant f, g satisfying (3.71) are given by (3.84) and (3.85). Further, f and g satisfy 
(3.71), (3.72), (3.73), and (3.73a). 


Corollary 3.55a. Let f,g : R — C be a nonconstant solution of (3.71) with g 
continuous. Then f is also continuous and g(x) = cos bx and f (x) = k sinbx with 
e= > where b is a complex constant. 


A Generalization 


Now we consider the functional equation (treated in Chung, Kannappan, and Ng 
[162]) 


(3.86) fa+y) = f@go)+e@)fO)+h@AQy) forx,y € G, 


where f, g, : G > Cand Gis a group, which is a generalization of many trigono- 
metric identities, for example (3.73), (3.72), and (3.72a). The familiar cos (x + y) 
and sin (x + y) admit such expansions as in (3.86). 


Result 3.56. [162]. Let f 4 0 and g,h : G > C be mappings. If they satisfy the 
equation (3.86), then they have one of the forms 


3 3 3 
(3.86a) f= > ae, s= > bee, h= > Gh, 
i=l i=l i=l 


with 
a\ bi Cl by bo b3 a{ 0 0 
a2 bo c2 a| a2 a3 = 0) a2 0) 3 
a3 b3 C3 ci C2 C3 0 0 a3 
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or 
f=QF\+@ME+a3FEA, 
(3.86b) g=bE, +hE+b3EA, 
h=cjE,+@mE+cEA, 


with 
ay bi cy b, bo b3 a, 0 O 
a2 bz c2 a, a2 a3} =} 0 aa3}; 
a3 b3 C3 Ci C2 C3 0) a3 0) 

or 


f=aqEA,;+aE+a3EA+ asE A’, 
(3.86c) g=b, EA, +E +b3EA + b4E A’, 
h=c,jEAj+@mE+c3EA+ c4E A’, 


with 
ay by cy by bo bx ba 0a 0 0 
az b2 c2 | a1 a2 a3 a4 
a3 b3 c3 pei ogc’ ban Ie a3 2a4 0 |’ 
a4 b4 c4 See a 0 a4 0 0 


where E, E; (i = 1,2,3) are exponentials satisfying (E), and A, A, satisfy (A). 
The converse holds. 


As special cases of (3.86), we can obtain the general solution of the functional 
equations (3.73) and (3.72a) on G. The equation (3.73) was solved by Abel [1] using 
differentiation. Hilbert’s fifth problem raised the question of solving Abel’s equation 
under weaker assumptions (Abel’s equations will be dealt with in Chapter 10) (see 
[33]). In [162], the general solution of (3.73) was determined. 


Corollary 3.56a. [162]. Let f (4 0) and g : G > C be mappings. If they satisfy 
the functional equation 


(3.73) f@+y) = f@)gs) + s@)s), 
then they are of the form 


f=QE,+aE+a3EA, 
g=bE,+boE+b3EA, 


with the constraint 
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Conversely, f and g given above with the constraints above satisfy (3.73) (cf. 
Remark 3.56b ). 


The proof follows from Result 3.56. 


Remark 3.56b. The general solutions of (3.73) are given by 


1 1 
(3.87) foqhi—ak, g- Fire fees. 


In Corollary 3.56a, the functional equation is of interest only when f # 0; 1.e., 
only the case (a1, a2, a3) # 0 is of interest. By the rank consideration in the con- 
straints, we get 

a; 0 O 
rank | O a2 a3 | <2. 
0 a3 0 

This implies az = 0 or ay = O. Having assumed (a1, a2, a3) 4 0, this implies 

b3 = 0 or bi = O, respectively. Thus f (¢ 0) and g are either in span{£ 1, E} or in 

span{E, EA}. 

(1) Suppose a3 = 0. Then b3 = 0. Since a; = 0 implies b} = 0 fori = 1,2, we 
may assume aj,a2 # 0 by adjusting F,, E. Then the form of f and g can be 
replaced by 

1 1 
f=aE,-aE and g= Fi + as 


(2) Suppose a3 # 0. Then a; = 0 = by. We may take a3 = | by adjusting A. With 
these, (3.81) can be replaced by 


f=EA and g=E. 


Corollary 3.56c. [162]. Let f, g : G - C be mappings. If they satisfy the functional 
equation 


(3.72a) gixty)=fa)fO)+e@)gQ) forx,y €G, 


then f(x + y) = f(x)givy) + ga) fO) + 2af (x) f() for some constant a, and f 
and g are of the form 


g=aqE,\+aE+aFEA, 
f=aE+aFE4+oEFA, 


with the constraint 


a} Cl a, 0 O 
a, a2 a3} _ 

a2 C2 =|0mMma], 
Cl C2 C3 


a3 C3 0 a3 0) 


where E, E, satisfy (E) and A satisfies (A). 
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Conversely, if f and h are given as above, where the coefficients satisfy the 
constraints above, then they form a solution of (3.72a), (cf. Remark 3.56a). 


Remark 3.56d. The general solutions of (3.72a) are given by 


f@)=c\(Ei—E), g@) =a Fi +U—-ay)ek; 


(3.88) f@) =+iFA, gx) =E+EFA. 


Consider the constraint in Corollary 3.52c. Since the rank of the matrices on the 
left is at most 2, we get 
a, 0 0 
rank | O a2 a3 | <2. 
0 a3 0 


This implies either a3 = 0 or aj = 0. 
(1) If a3 = 0, then cz = 0, and so f,h © span{E;, E}. When a; = 0, we must 


also have c; = 0. We may therefore assume aj, a2 4 0 by adjusting EF}, E if 
necessary. With this assumption, f and g can be replaced by 


f=aqE,;+U-a)E and g=cjE,;—-cyE 
with ay + cf = a1 (a; 4 0, 1 optional). 


(2) If a3 A 0, then aj = cj = 0. We can take a3 = | by adjusting A. With this, f 
and g can be replaced by 


f=E+EA and g=x+iEA. 


Result 3.57. (Aczél [12]). The nonzero continuous solutions f, g : R > Rof (3.72) 
are given by 


gax)= expcx, fQy= bexpcx, (b4#+1); 


1 1 
l=p 1—b? 
g(x) = (1+ bx) expcex, f (x) = bx exp cx; 

g(x) = expcx(cos bx +d sinbx), f(x) = V1+d? expcex sin bx; 


g(x) =expcex(coshbx + dsinhbx), f(x) = Vd? — lexpex sinh bx; 


and that of (3.73a) is given by 


g(x) =1+dx, f(x) = dx; 
g(x) =cosdx + bsindx, f(x) = bsindx; 
g(x) = coshdx + bsinhdx, f(x) = bsinhdx, 


where b,c, d are constants. 
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Many, including Steinitz and Neder (see [12, pp. 136]), treated the functional 
equation 


B89) ft») = fF (Ftr)+F(F+4) £0), forxyeR, 


under continuity of f : R — C. We determine the solutions of (3.89) using (3.73). 
Theorem 3.58. Suppose f(4 0): R— C satisfies (3.89). Then f is given by 


(3.89a) Ff) = 5E2(x-4) or Fe) =a(B@) - Exte)), 


with 2a = E| (—5), Ey (—3) Ex (% z)= =-l. 


Further, if f is (measurable) continuous, then f has the form 


(3.89b) f®)= x expe (« = =) or f(x) =a(expex — expdx), 


with 2a = exp(—c4), exp(c —a)F = —1. 
Proof. Define g(x) = f (5 + x) for x € R. Then (3.89) becomes (3.73). We have 
to determine the solution corresponding to (1) of (3.87) (note that a cannot be zero), 
1 1 1 
f(x) =a(Eve)- Ee), f ($+) = 5216) + 5220), 


where EF, E2 satisfy (E). Thus 


BiG) + 5Ba(s) = a[£100): (5) - Eo) (5) 


or 


(3.90) E\(x) E =aE| (5)|- = — E(x) E +aEp (5)]. 


Suppose 4 +aE2 (4 = = 0 in (3.90). Then either E; (x) = 0 or 5 —aE, ( 
Henge when E\(x) = 0, we have f(x) = —aE2(x) witha = -5 
which is (3.89a) (first part). If 5 —ak, (5) = 0, then 


a se (-5) and Ey (5) Om oe (5) =, 


f(x) = a(E\ (x) — E2(x)), which is the second part of (3.89a). If 5 +aE2 (5) #0, 
then F2(x) = cE,(x), and we get (3.89a) (first part). 
Corresponding to (2) of (3.87), we get 


f(x) = E(w)A(), 
f (F +x) = £@) 


=FG)E (5) (AG) +A (5)) 


$)=0. 
E2(-§). 
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that is, since E 4 0, A(x) +A (5) =E (—$), meaning A is a constant and so 
A = 0, which cannot be. This case is not possible. 
The continuous solution can be obtained either from (3.89) or from Remark 3.56b. 


3.4.9 Vibration of String Equation (VS) 


Finally, in this section, we treat the equation (VS), mentioned in the introduction, 
considered by d’ Alembert, namely 


(VS) fa +y)—f@—-—y)=AC)k(y) forx,y ER, 


where f,h,k : R — C (or R). (See also Sinopoulos [746].) 
First we consider the trivial solutions. If f = constant, then either h = 0, k 
arbitrary or / arbitrary, k = 0. 


(3.91) When k = constant, y = 0 givesh = 0, f = constant. 
Suppose h(x) = d. Then (VS) becomes 
(3.92) f(x+y)—f@—y)=dk(y), d #0. 


Set x = y in (3.92) to get f(2y) — f(O) = dk(y). 
Setting x + y=uandx — y = 0 in (3.92), we get 


= 


fu) — f(v) = dk (“ ) = f(u—v) — f) 


and 
fo) + f(-v) = 2f (0) 


(interchange u and v or put u = 0). Interchange x and y in (3.92) and add the 
resultant to (3.92) to obtain 


2f (x + y) — 2f(0) = dk(x) + dk(y); 
that is (from (PA), Result 1.56), 
f(x) = AQ) +b, dk(x) = 2A(x), 


where A is additive (it could have used that f (wu) — f (0) is additive). So, the solution 
is 


2 
(3.91a) f@)=AQ)+b, ha)=d, kaw)= qos d#0. 
The continuous solution is 


(3.91b) f@=cx+b, h(x)=d, kw)= “cx. 


182 3 Trigonometric Functional Equations 

We will now consider the nontrivial solutions of (VS). Put x = 0 in (VS) to have 
(3.93) f(y)— f(-y) =hO)k)  fory ER. 
We now consider the cases according to whether /(0) = 0 or h(O) 4 0. 


Case 1. Suppose /(0) = 0. Equation (3.93) shows that f is even. Interchange x and 
y in (VS) to get 


fa+y)—f@-y)=hA()k@), forx,y ER, 


or 
h(x)k(y) = hk); 


that is, 
k(x) =ch(x) forx ER. 


Note that c cannot be zero. Otherwise k = 0 and f is a constant. 
Then (VS) becomes 


(3.94) fi«aty)-f@-y)=ch(x)h(y) forx,yeER. 


Replace x by 5 and y by 5 to have 


f(x) = fO)+cn (Gy —b+ch Gy 


so that (3.94) becomes 
2 2 
h (- ~ *) —h (=) = h(x)h(y) 


(S) h(u + v)h(u — v) = h(u)? — h(v)* 


or 


(set x = u-+ov, y =u —v), which is the sine equation (S) and the general solution 
of (S) as given by Theorem 3.44, 


(3.91c) h(x) =A(x) or A(x) =d(E(x*) — E(—x)) forx €R, 
where E : R — C* is exponential and A is additive. Then 


(3.91d) k(x) =cA(x) or k(x) =cd(E(x)— E(—x)) forx €R, 


2 
GB9le) f()=b+ AGP or fx) =b +ed? (z (5)-« (=)) | 
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If f is measurable or continuous and f : R > C, then 


(3.91d)) h(x) = dx or dsinax, a,dé€c, 
(3.91d2) k(x) = cdx or cdsinax, 
(3.91d3) f@)=b+ 5x? or b+ 4ca? sin? — 


where a,b,c,d€C. 
If f,k, : R > R are continuous and satisfy (VS), then (Theorem 3.43) 


(3.91da) 
h(x) = dx or d sin ax or asinh(ax), 
k(x) = cdx or cd sinax or cd sinh(ax), 
da 
f@)=b+ x7 or b — 4ca? sin? > or b + cd? sinh? 
cd* 2 2 2 2 pre 
ee a or b — 2cd* + 2cd“cosax orb-+cd* sinh > 


where a,b,c,d ER. 


Case 2. Let h(0) 4 0. Then, from (3.93) we see that k is odd. Replacing x by - and 
y by 5 in (VS), we have 


f(x) = fO +h (5) k (5) exbach (5) « (5) 


2 
and 
(3.95) hn (E*) (2) 0 (2) (G2) = hone 
or 
(3.95a) h(u)k(u) —h(vo)k(ve) =h(u+ov)k(u—v) foru,o ER. 


Replace v by —p in (3.95a) and subtract (3.95a) from the resultant to obtain 


k(u+v)h(tu —v) —k(u— v)h(u + v) = k(v)(A(v) + A(—0d)) 
=h(O)k(2v) (let u = v, use k(0) = 0), 


which can be put in the form 
(3.96) kK(Qx)h(Qy) —kQ)hQx) =AO)K(&—y) forx,yEeR. 
Replace y by —y in (3.96) and add the resultant to have 
AO)K@+y)+k@—y) =kKa)AY) +h(-y)), 


which is the same as (3.27) (take 29(y) = mon (y) + h(—y))) (we do not need g). 
Using Theorem 3.41, we get k = 0, which cannot be, or k is Jensen, and since k is 
odd, 
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(3.91f) k(x) = A(x) or k(x) =c(E(x) — E(-x)), 


for x € R, where E satisfies (E) and A satisfies (A) (see (3.27)). 
Suppose k(x) = A(x) and (3.96) yields 


AQx)(A(y) — 1) = AQ)A@)— 1), c1 =A), 
and since A is not zero, we get 
(3.91f1) h(x) =c1 +c2A(x), 
and then from (3.95) we have 
(3.91f) f(x) =b+ SAG) ai FAQ), xeR. 


If f is measurable or continuous, 


d 
GB.91g)  k(x)=dx, h(x) =e + dx, f(x) = b+ x + dx, 


for x € R, where d, c,, c2, b are constants. 
Now we consider the case k(x) = c(E(x) — E(—x)). As before, substituting for 
k in (3.96), we obtain 


(E(x) — E(—x))h(y) — (EQ) — E(-y)h@) = c1(EQ@ — y) — E(-x + y)) 


or 
E(x) ~~) 
/ = A(x) = ~ 
(Ew) - EG — a(y) — (20 Eo — h(x) = a( E(x))° 


that is, 


(E(x)? — I(EQ)AGQ) — 1) = (EQ)? — D(E@A) - 1), 
meaning 
(3.91f3) h(x) = c3 E(x) + (cy —c3)E(—x) forx ER. 


(Note that if E(y)* = 1 for all y, then E(y) = E(—y), and then k = 0, which is not 
the case.) 
Now (3.95) yields 


(3.91f4) f(x) =b+ cc, — 2ec3 + cc3 E(x) — c(cy — c3)E(—x). 
Suppose f,4,k : R — C are continuous and satisfy (VS). Then 


k(x) = c(expdx — exp(—dx)), 
(3.97) h(x) = c3 expdx + (c1 — c3) exp(—dx), 
f(x) =b+ cc, — 2cc3 + cc3 expdx — c(c, — c3) exp(—dx). 
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If f,4,k : R > R are continuous and satisfy (VS), then 
(3.97a) k(x) =csindx or csinhdx. 
Substituting k(x) = c sindx in (3.96) gives 


h(y)c sindx — h(x)c sindy = cc, sind(x — y) 


= cc;(sindx cosdy — cosdx sindy) 


or 

sindy(h(x) — c, cosdx) = sindx(h(y) — c cosdy), 
yielding 
(3.97b) h(x) =c,cosdx + c4sindx. 


Using (3.97a) and (3.97b) in (VS), we get 


fa@+y)—f@-y) 


(c, cosdx + cq4sindx)c sindy 


= cc, cosdx sindy + cc4 sindx sindy 


= Ising +y)—sind(x — y)] 
+ Fleosd —y)-—cosd(x+ y)], 


yielding (compare Result 3.1a and Result 3.2a) 
(3.97c) f@= — sin dx - — 008 dix + +b. 


k(x) =csinhdx results in 


(3.97d) h(x) = c, cosh(dx) + ca sinh(dx), 


(3.97e) f@= > sin h(dx) — cos h(dx) 4: > +b. 


Thus we have proved the following theorem. 


Theorem 3.59. Suppose f,h,k : R > C satisfy (VS). Then f,h,k are given by 
(3.91) and (3.91a); (3.91c), (3.91d), and (3.91e); (3.91f), (3.91f)), and (3.91f2); and 
(3.91f), (3.91f3), and (3.91f4). Further, if f,h,k are continuous, then k,h, f are 
given by (3.91d}), (3.91d2), (3.91d3), (3.91d4), (3.91 g), and (3.97). 

If f,h,k : R — R are continuous and satisfy (VS), then f,h,k are given 
by (3.91b), (3.91d4), and (3.91g); (3.97a), (3.97b), and (3.97c); and (3.97d) and 
(3.97e). 
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3.4.10 Wilson’s Equations 
Finally, we consider the functional equations 


(3.98) S(x — y) = S(x)CG) — S(y)C&), 
(3.98a) C(x — y) = C(x)C(W) — k7S(x)S(y), 


where S,C : G — C, treated by Wilson in [580, 729, 12], and present a simple, 
direct proof of the following theorem. 


Theorem 3.60. Suppose S,C : G — C, where G is a group, satisfy (3.98) and 
(3.98a) with C satisfying (K), and k is a complex constant. Then S = 0, C = 0; 
S =0, C aneven homomorphism; when S 4 0 andC #0, S is odd and C is even; 
when k # 0, we have 


_ EG)+E(-x) 
=, 


(3.98b) C(x) S(x) = £5 (E(x) — E(-x)), 


where E is exponential, satisfying (E); and when k = 0 and G is 2-divisible, then 
(3.98c) C(x)=1, S(x)=A(x), 


where A is additive, satisfying (A). 


Proof. It is easy to see that S = 0, C = O is a solution. Suppose S = constant 
= C #0. Then (3.98) shows that C(x) = constant and then C = 0, which cannot 
be. Suppose S = 0 and C 0. Then (3.98a) shows that C(x — y) = C(x)C(y). 
Interchanging x and y shows that C is even and that C(x + y) = C(x)C(y); that is, 
C is ahomomorphism. So, we will consider only S 4 0 and C ¥ 0. First we prove 
that C satisfies (C). 

An interchange of x and y in (3.98) yields that S is odd and S(O) = 0 and in 
(3.98a) shows that C is even. 

Replace y by —y in (3.98a) to get 


C(x + y) = CR)CQ) + FS(x)S(), 
and adding this with (3.98a) gives 
C(x +y)+C@ —y) = 2C(%)C), 


which is (C). Then, by Theorem 3.21, there exists an exponential E : G > C* such 


that 
E(x) + E(-x) 


Ce).= 5 


Substituting this C(x) in (3.98a) gives 


1 
PS()S(y) = FE) — E(x) (EQ) - Ey) 
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or 
S(x) = d(E(x) — E(—x)) 


(when k 4 0) since S$ 4 0. Putting this S(x) in the equation above, we get d = 4 
when k 4 O and S(x) = +5(E(x) — E(—x)), giving (3.98b). 
Alternatively, replace y with —y in (3.98) to get 


NI 
>| 


(3.73) S(x + y) = SQx)C(y) + SQY)C(x). 
Define E : G > Cby 


E(x) = C(x) +kS(x) fork £0. 


E(x+y)=CO+y)+kS@ + y) 
= C(x)C(y) +k? S(x)S(y) + K(S(X)CO) + SO)C)) 
= (C(x) +kS(&x))(C(Y) + KSC) 
= E(x)E(y) 


shows that F is exponential. Then 
E(—x) = C(x) —kS(«), 


giving again (3.98b). 

When k = 0, we get C(x — y) = C(x)C(y) = C(y — x); that is, C is even, 
C(x + y) = C(x — y), C(x) = constant = d (here we use that G is 2-divisible), 
and d* = d, yielding d = 1. Hence C(x) = 1. Then (3.73) shows that S is additive, 
yielding (3.98c). This completes the proof of this theorem. 


3.4.11 Analytic Solutions 
In [701], the author says that to his surprise he found 
|sin(x +iy)| = |sinx +siniy| forx,y €R. 
It is also true that 
| sinh (x +iy)| =|sinhx+sinhiy| forx,y €R. 
From these arise the functional equation 
(3.99) If@ +iy)| =|lf@)+ fGy)| forx,y ER, 


and the following result holds. 
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Theorem 3.61. (H. Haruki [349, 350]). The only functions f : C — C regular 
(analytic) for |z| < y satisfying (3.99) are 


(3.99a) f(@) =az, f(z)=asinbz, or f(z) =asinhbz, 


where a is a complex constant and b is real. 


Proof. Suppose f : C > C satisfying the sine equation (S) is an entire function and 
J (z) is real on R. 

Then f is odd (ie., f(—z) = —f(z), z € C) and f(s+it) = f(s — it) 
fors,t ER. 

Now, 


If +in? = f(st+inf@ tif = f@ +it)f—it) 
by (S) = f(s)’ — fGt)? = f(s)? + f (it) f (-it) 
= f(s + fUNFED =F OP +1F GOP 


leads to the study of the Pythagorean functional equation (Hille [373], H. Haruki 
[348, 349]) 


(3.99b) lf@t+iyP =lf@OP+1FO)?, x.y ER. 


Equations similar to (3.99) are (3.99b), (3.99c), and (3.99d): 


(3.99c) If +iy? =f @O?+1fG)P -1, 
(3.99d) If tiy)] =|f@)+ fy), forx,y,R. 


Result 3.61a. [373, 348]. Suppose f : C > C is an entire function. If (i) f satisfies 
(3.99b), then f has the form (3.99a); (ii) if f is a solution of (3.99c), then f is given 
by f (z) = expi@ cos bz or expi@ cosh bz; and (iti) if f is a solution of (3.99d), then 
f is given by f (z) = az? or asin? bz or a sinh’ bz for z € C. 
Some Generalizations 
Suppose f, g, : C > C are entire functions satisfying the functional equation 
(3.99e) If(« +iy)| =lg@)| + |hG@y)| forx,y ER. 
It is easy to check that f satisfies the equation 
If +iy)|+1FOl=lf@I+1f@)| forx,y eR, 

the solutions of which are given by [373, 146, 349] 

f@)=ae—by fe) = (ae* +be“Y, 


where a,b, 1 € C, 2? € R. Thus, determining the solutions of (3.99e) boils down 
to solving the equations 
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(3.99e1) lg(x)| + |A(iy)| = la(z — b)?| 
and 
(3.99e2) |g(x)| + |ACy)| = lae*® + be~*2|2, 


where z=x+iy, x,yER. 
Let Q denote the set of all entire functions such that their power series expansions 
have real coefficients. The following results hold. 


Result 3.61b. (H. Haruki [355]). The only system of entire solutions g and h of 
equation (3.99e1) are 


gq) =a—soye?,  h(z) = a — ito)", 
where b = sy + ito and p,q,r € Qand q and r are odd and even, respectively. 


Result 3.61c. [355]. The only entire solutions of (3.99e2) in the case 1 € Rare 


g(z) = = arer2e ti 2) 
h(z) =0, 


when b = 0, and 


J 
g@) = io Bilaet Ol be AE), 
h(z) = 2O e1@tir@ qeAotd 4 pe~A60t D2, 


1h(z) 


when a #0, b £0, where p,qg,r € Q, gq andr are odd and even, respectively, 
so = (1/24) log |b/a|, to = 9/24, 9 € arg(—b/a), and I, Ji, In, Jz are given by 
the formulas 


00 : 
Z Z 
I =z" 1——)e%, 
sei) 
j=l , 
00 2 : 
Jx(z) = val I] (: - =) ed, dj E {e;, —Cj}. 
J 
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Theorem 3.62. P(z) and Q(z) are integral functions satisfying 

(3.100) P(zy’- Q(z)? =1, forz EC, 

if and only if there exists an analytic function f (Zz) with the property that 
(3.100a) P(z) =cosh f(z), Q(z) =sinh f(z), zeEC. 


Proof. (See also Ganapathy [298].) Note that (3.100a) includes the familiar solutions 
(cosh z, sinh z). 
Let (3.100) hold. Then 


PQ) — O@) = [P@) + O@IIP®) — O@1=1. 


Hence P(z) + Q(z) and P(z) — Q(z) are integral functions without zeros. Hence 
there exists an integral function f(z) such that 


P(z) + Q(z) =exp f(z) and P(z) — Q(z) = exp(—f(Z)). 
Thus (3.100a) holds. This completes the proof of the theorem. 


Theorem 3.62a. If the solutions (3.100a) of (3.100) are periodic, then the same is 
true of f'(z). Conversely, if g(z) is an integral function with g'(z) periodic, then 
there is a constant « such that wg(z) put in (3.100a) for f(z) gives a periodic solu- 
tion of (3.100). 


Proof. Suppose the solutions (3.100a) of (3.100) are periodic with period 1. Then 
we have 
cosh f(z + 2) — cosh f (z) = 0; 


that is, 
exp f(z +4) +exp(—f(z + 2)) — exp f(z) — exp(—f (z)) = 0, 


meaning 


[exp {f(z + 4) — f(z)} — 1]lexp f(z) — exp(—f@ + 4))] = 9, 
from which it follows immediately that, for every complex z, either 
f(z +A) — f(z) = 2kai 
or 
SHA + f(]@) = 2zi, 


where k and f are integers. Since f is an integral function, we can conclude that 
either the former is true at all z or the latter holds at all z. If the former holds for all 
z, then we have 


F@+A)— f'@) =0, 


from which we see that f’(z) is periodic with period A. 
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In the latter case, we get 
FE+AD+F@O=0 0 f/e+2W=f/'@, 


with the result that f’(z) is periodic with period 21. In any case, we see that f’(z) is 
periodic. 
Now let g(z) be an integral function with g’(z) periodic, say with period 2. Then 


g(z +A) — g(z) =c constant. 


If c = 0, taking uw = 1, f(z) = g(z) in (3.100a) gives a periodic solution of (3.100). 
Ifc 40, take w = 2k , where k is an integer (4 0). Then f(z) = wg(z) in (3.100a) 
gives a periodic solution of (3.100), which can be easily verified from the identities 


CHD 


D 
cosh C — cosh D = 2 sinh CF sinh 


and 


D 
sinh C — sinh D = 2cosh cs 


This completes the proof of this theorem. 


Theorem 3.62b. /f the solutions in (3.100a) of (3.100) are of finite order, then f (z) 
in (3.100a) is a polynomial in z. If, further, P(z) and Q(z) are periodic solutions, 
then f (z) is linear in z so that P(z) = cosh (az + b) and Q(z) = sinh (az + b). 


Proof. If P(z) and Q(z) are of finite order, then so is P(z) + Q(z). Hence, from 
P(z) + O(z) = exp(+f(z)), we see that f(z) is a polynomial. If, further, P(z) 
and Q(z) are periodic solutions, from Theorem 3.62a we have that f’(z) is periodic. 
Hence f’(z) is a constant and so f(z) is linear in z. Thus the proof of this theorem 
is complete. 


Theorem 3.62c. There exist no integral functions P(z) and Q(z) satisfying the equa- 
tion 


(3.100b) P(z)" — O(z)" =1 


where n is any integer > 3, except when both P(z) and Q(z) are constants. 


2 
eee 


Proof. Let w = exp at and 1, a, a’, a”! be the nth roots of unity. Then 


P(z)" — O(z)" =[P@) + wOG)I[P@ + awQ)]...[P@) +a" 'wO)] 
= 1. 
If P(z) and Q(z) are integral functions satisfying the conditions above, then 


P(z) +wOQ(z),..., P(z) +a"! wO(z) 


P(z) 


Oz) cannot 


are integral functions having no zeros. Thus the meromorphic function 
n—-1 w 


take the n values —w, —aw,...,-—@ 
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But, by Picard’s Theorem, ne : can omit at most two aie unless ae ; is con- 


a + is a constant, from 
G: 100b) ¢ iS evident that P(z) and Q(z) are constants. This completes the proof of 
this theorem. 


3.4.12 Equation (3.72) on Analytic Functions 


Now we determine the continuous linear functionals F and K on H(S), the algebra 
of analytic functionals on an open set S C C satisfying the functional equation 
(3.101) F(fg)= F(P)F(g) -K(P)K(g)_ for f,g € H(S). 


Theorem 3.63. (Kannappan and Nandakumar [488]). Let F and K be nontrivial 
linear functionals on H(S) satisfying the functional equation (3.101). Then one of 
the following is true: 


(i) If K(1) £ 0, there exist constants k\, k2, homomorphism N, and z, € S such 
that 
F=k\N, K=kN; 


Lé., 
K(f)=hf@), FP)=mf), for f ¢ H(S). 
(ii) If K(1) = 0, FC) = 1, and K(1) # 0, then K and F satisfy the functional 
equation 


(3.10la) K(fg) = K(f)F(g)+ K(g)F(f) +cK(f)K(g), f,g € H(S), 
and there exist a z; € S and a constant c with c2 = 4 such that 


KN=KoOre, FN=fen-*PPen, for fe HO), 


(ili) there exist z1,z2 € S, z1 4 Z2, and a constant c # +2 with 1 + i 


K (1)? = 0 such that 


Kes a -(F(@1) = FMI 


— 


F(f)= stra) + f@2)] +5 a meACE — fG2)], for f € H(S). 


Proof. In the following, we need the Result 3.63a in Rubel [714], Zaleman [838], 
and Nandakumar [643], and some auxiliary results. 


Result 3.63a. If K and F are two linear functionals satisfying the functional equa- 
tion 


(3.101b) K(fg) = K(f)F(g) + F)K(g) on H(S), 
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then one of the following is true: 


(a) There exists z; € S such that 


KN=KMFED), FN=5fEd, fe HO): 
(b) there exists z; € S such that 
K(f)=K()f'(i), F(f)=fi), f € HS); 
(c) or there exist z1, Z2 € S with z1 4 z2 such that 
K(f) = AO re — f@2)], Ff) = strlen + f@2)], f € HGS). 
From now on, we assume that the linear functionals K and F satisfy (3.101). 
Lemma 3.63b. Let F and K be nontrivial linear functionals on H (S) satisfying the 
functional equation (3.101) and let K (1) 4 0. Then F and K are constant multiples 


of a homomorphism; i.e., there exist constants k; and kz and a homomorphism N 
such that F = kN and K = kxN with ky = kf — k3. 


Proof. Setting f = g = 1 in (3.101), we get 
F(1) = F(1)? — K(1)’. 


Suppose K (1) 4 0. Then F(1) is neither 0 nor 1. Hence, taking g = 1 in (3.101), 
we obtain 

F(f) = F(f)F() — K(P)K() 
so that 


F(f) =kK(f), 


Far # 0. Substituting this relation in (3.101), we obtain 


kK (fg) =k°K(f)K(g) — K(f)K(g) = (k? — 1)K(f)K(g) 


where k = 


or N = ae) is ahomomorphism. Note that k ¢ 0, +1; otherwise K is a trivial 
linear functional. Thus K and F are constant multiples of a homomorphism. 


Lemma 3.63c. Let F and K be nontrivial linear functionals on H (S) satisfying the 
functional equation (3.101). If K (1) = 0, then F(1) 4 0. 
Proof. Suppose F (1) = 0. Then 

F(f)=FQ-f)=FO)FY)—-KQ@)K(f) =0 forall f ¢ H(S), 
which contradicts the hypothesis that F is nontrivial. Hence F(1) 4 0. 


Lemma 3.63d. Let F and K be linear functionals on H (S) satisfying the functional 
equation (3.101). If K (1) = 0, F(1) = 1, and K (1) = 0, then F is a homomorphism 
and K is a trivial linear functional. 
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Proof. Let F(J) = zo. We claim that zo € S. Suppose not. Then ss € H(S). 
Then, since K (1) = 0, F(1) = 1, and K (J) = 0, we have by linearity of F and K 
that 


1 1 I 
1=F(U-2) 7) = FU —z0)F (—) - KU =k (——) 


= (FU) — F(Z0))F (—) — (K(1) — K(z0))K (—) =0, 
— 20 I —zo0 


which is a contradiction. Thus zg € S. 


Since zo € S, we have Lf) € H(S). Hence 


F(f) — f@o) = FY — f@o)) 


_ ole of — £0) 
=F (¢-0G5) 
2 P= aoyk (EY) -Ka-ak (2) 
I—2z0 I-—zo 

= (FD) — Fo) F (i) 

— 20 

— (KW) ~ K@o))K (+ Po) =0. 
— 20 


Thus F(f) = f (Zo), which is ahomomorphism. This in turn implies K = 0. 


Lemma 3.63e. Let F and K be nontrivial linear functionals on H (S) satisfying the 
functional equation (3.101). If K(1) = 0, FC) = 1, and K (1) # 0, then F and K 
satisfy the equation 


(3.10la) K(fg)=K(f)F(g)+ K(g)F(f)+cK(f)K(g), f,g € H(S), 
where c is a constant. 


Proof. From F(fgh) = F(fhg), and applying (3.101) a couple of times, we get 


F(fgh) = F(fg)F(h) — K(fg)K (A) 
=[F(f)F(s) — K(f)K(g)IF) — K (fg) K (h) 


and 


F(fhg) = F(fh)F(g) — K(fh)K(g) 
= [F(/)F(h) — K(f)K (A) F (8) — K (fh) K (8). 


Comparing the two equations above, we obtain 


[K (fg) — K(f) F(g)]K (h) = [K (fh) — K(f) F(A)IK (8). 
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Setting h = J and using K (/) 4 0, we have 


(3.102) K(fg) = K(f)F(g) + N(f)K (a), 


where N(f) = K(1)~'![K ([f) — K )K (f)]. Interchanging f and g in the preced- 
ing equation, we get 


(3.102a) K(f)(N(g) — F(g)) = K(g)(N(f) — FY). 


Replacing g by J in (3.102a) and solving for N(f) using K(J) 4 0, we obtain 
N(f) = F(f) + cK (f), where c = K(1)~'[N() — F()]. Using this in (3.102), 
we have 


(3.101a) K(fg) = K(f)F(g) + Ke) F(f) + cK (f)K (8). 

Lemma 3.63f. [714]. [f F is a homomorphism on H(S), then there exists azo € S 
such that F(f) = f (zo) for all f € H(S), hence the result. 

Proof of the Theorem 

Proof. (For another proof, see [488].) 


Case 1. K(1) 4 0. By Lemma 3.63b, there exists a homomorphism WN such that 
F = kiN and K = k2N. Since by Result 3.63a there exists a z} € S such that 
N(f) = f (21) for all f € AH(S), the result (i) follows. 

If K(1) = 0, then F(1) = F(1)* — K(1), which implies that F(1) = 0 or 
F(1) = 1. But, by Lemma 3.63c, F(1) 4 0. Thus we have the following two cases 
when K (1) = 0. 


Case 2. K(1) = 0, FC.) = 1, and K(J) = 0. From Lemma 3.634, this is not 
possible since this leads to K = 0. 


Case 3. K(1) = 0, F(1) = 1, and K(/) 4 0. By Lemma 3.63e, we have 
(3.10la) K(fg)=K(f)F(g)+ K(@)F(f) +cK(f)K(g), fig € H(S), 
where c is a constant. Rewriting the above, we obtain 

K (fg) = K(ALF(@) + 5K@N+K(@) [FN + 5K], 
which is of the form (3.101b). Since K (1) 4 0, solution (a) of Result 3.63a is not 
possible. Now we apply solution (b) of Result 3.63a. Thus there exists a z; € S such 


that 


K(f) =K()f'@1), F(f)+5K(f) = f@ for f € H(S); 
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that is, 
K(f)=K()f'@), FS)=fe)- SKS C1); forf € H(S), 


which is (ii) of the theorem. This K and F satisfy (3.101b), provided c* = 4. Note 
that K (1) = 0 and F(1) = 1 are satisfied and the solutions are 


K(f)=KMf'@), FA=fEeDEKMS' 1), for f € A(S). 
Finally, corresponding to solution (c), there exists z1, z2 € S with z; € z2 such that 


RAS ep FON: 
a inmate 8) 


1 
BO SKA) = 5lf@i + f@2)], for fe H(S); 


that is, 


K(1) 
K(f)= noe — f (z2)I, 


— 22 
KW) 
i= 


1 
F(f) = sf Gi) + F@2)] — 5 (fi) — f@2)], for f « H(S), 


£2 


which is the second part of (iii) of the theorem. These F and K satisfy (3.101b) 
when 


= 22 
1+——— K(1)? =0, where c # +2. 
(i= 22) 
Ifc =0, KU y = — Misa)" and the solution takes the form 


| 1 
K(f)= +51F 1) — FG) FO) = slf@i) + f@2)I- 


This completes the proof of the theorem. 


Remarks 


1. The solution of the equation F (fg) = F(f)F(g)+ K (f)K (g) can be obtained 
from (3.101b) by replacing K byikK. 

2. The solution of K(fg) = K(f) F(g) — F(f)K(g) is K = 0 and F arbitrary 
(interchange f and g). 

3. We have shown that the linear functionals satisfying (3.101b) are linear combi- 
nations of evaluation functionals, which implies that these functionals are con- 
tinuous on H(S) equipped with the topology of uniform convergence on com- 
pact subsets of S. Hence we ask whether we obtain additional solutions if the 
requirement of linearity of the functionals is weakened to additivity. 
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3.4.13 Some Generalizations 


In this section, we treat many generalizations. We start off with the general functional 
equation 


(3.103) f@ty)+e—-—y)=h(x)k(y) forx,y €G, 


where f, g,h,k : G > Cand Gis a 2-divisible Abelian group. 
First we will show that the study of (3.103) is equivalent to the study of 


(3.103a) F(x +y)N(« — y) = A(x)4+ K(y) forx,y eG, 


where F,N,H,K:G—>C. 
Putting x + y = 2u and x — y = 2b first in (3.103) results in 


h(u+v)k(u —») = fu) +gQv), 
which is (3.103a), and second in (3.103a) it goes over to 
Htu+v)+ K(u—v) = FQu)N(2v), 


which is (3.103). 

So, we will consider only (3.103) (see Wilson [831]). Special cases of (3.103) 
have been discussed elsewhere in this book and in the literature. Some of the more 
familiar special cases include Jensen’s equation (J) (g = h = f,k = 2), the Pexider- 
Cauchy equations (g = 0,h = k = f), the cosine equation (C) (g = f,h = f, 
k = 2f), the generalized cosine equation (3.27) (g = f,h = f,k = 2), and the 
vibration of strings equation (VS) (g = —f), among others. 

As a matter of fact, we determine the most general solutions of (3.103) in the 
following theorem connected to (3.27) and (VS) and the usual Cauchy equations (A) 
and (E). 


Theorem 3.64. (See also [831].) Suppose f, g,h,k : G > C (with G, a 2-divisible 
Abelian group) fulfill equation (3.103). Then they are given by: 


G) h=O0ork=0, f=c,g=-c. 
(ii) Case (i). h = constant c (4 0), k # 0. 


2 
F(%) = A(X) +1, 8) = —A(x) + €2, kx) = ~ AG) +3, 


with cy + c2 = €c3. 


Case (ii). k = c (4 0), h £0. 


2 
f(x)=AQ), g(x) =A)4+cc1, AG)= ~AG@) +c,. 
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(iii) Case (i). f =0, g £0, A,k ZO. 
g(x) =beE(x), h(x) =bE(x), k(x) =cE(-x). 
Case (ii). g =0, fh, k £0. 
f(x) =bcE(x), h(x) =bE(R), k(x) =cE(2). 


(iv) f, 9,4, k are nonconstant. 

Case (i). 4(0) = 0 or k(O) = 0 reduces to (VS) and 

(a) f(x) =b+ SA)? = —g(—x), h(x) = cA(a), k(x) = AQ), 
f(x) =b + cd*(E(x) + E(—x) — 2) = —g(-x), 

h(x) = ck(x) = cd(E(x) — E(—x)), 

fx) =b+ FAQ) + ZAC) = —8(—x), h(x) = AG), 
k(x) =c1 + c2A(x), 

f(x) =d+cc3E(x) — c(c, — c3)E(—x) = —g(—x), 
h(x) = c(E(x) — E(—x)), 

k(x) = c3E(x) + (c1 — c3)E(—x); 

(b) f(x) =b+ FAQ)? = —g(x), A(x) = A(X), k(x) = cA), 

F(x) =b + cd2(E(x) + E(-x) — 2) = -8@), 

k(x) = ch(x) = cd(E(x) — E(—x)), 

fe) =b+ FAX) + ZAC) = —g (x), A(X) = c1 +: c2A(a), 
k(x) = A(x), 

f(x) =d +cc3E(x) — c(c1 — c3)E(—x) = —g (x), 

h(x) = c3E(x) + (c1 — c3)E(—x), 

k(x) = c(E(x) — E(-x)). 

Case (ii). (0) £0, k(O) £0. 

(a) f(x) = Go2A(x)? + c4A(x) +5, h(x) = A(X) +b, 

BQ) = —47e2A@)? + CoA) $07, k(x) = 2A) +03, 
with 2c4 = c3 + bea, 2c6 = c5 — ber, c5 +¢7 = bo3; 

(b) f@&) =cio(E@) + E(—x)) + c11(E(@) — E(-x)) +65, 
g(x) = c13(E(x) + E(—x)) + c12(E(x) — E(—x)) +.€7, 
h(x) = $(E(a) + E(—x)) + (E(x) — E(-x)), 

k(x) = cg(E(x) + E(—x)) + co(E(x) — E(—x)), 
with cj9 = “ss + be4, cy, = “< + beg, c13 = Acs — beg, 


C12 = beg — i, c5 +c7 = 0, where A is additive satisfies (A), E satisfies 
(E), and b,c, d, and c, to c\3 are constants. 


Proof. We have to consider several cases. 
Solution (i) is obvious. Suppose h = c 4 0, k € 0. Equation (3.103) becomes 


fa+y)+s@—y)=ck) forx,yeG. 
With x = y, we get f(2y) + g(0) = ck(y), so that we have 


f@+y)+g@—y)=f@y)+e(0) forx,y eG. 
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Setx + y =u, x — y =v to obtain 


flu) + gv) = ftu—v) + gO), 
£0) + gv) = f(-v) + 8), 


and 


flu) + f(-v) = ftu— 0) + FO), 


which with A(x) = f(x)— f (0) yields A(u—v) = A(u)+A(—v), which is additive. 
This solution results in (ii) case (i) (and could have used the Pexider equation, too). 
As for (ii) case (11), assume k = c £0, h 40. Then 


f@+y)+ e@ — y) = ch); 


that is, f(x + y) + g(x — y) = f (2x) + g(O) (set y = x in the equation above ). 
As before, the equation reduces to 


flu) + gv) = fut+v) +g), 


which is a Pexider equation, and 
fu)=AW), gtu)=Al)+g0), ch) = 2A(x) + 8(0). 


This gives the solution (ii) case (ii). 
Let us now consider the case f = 0, g, hh, k #0. Then (3.103) takes the form 


g(x —y)=hA(x)k(Qy) forx,y €G. 
This is a Pexider equation. With x = 0 and y = 0, separately we have 


g(—y) =hO)k(y), g(x) = A(x)k(O), 
so that 


1 
h(O)k(0) 
(h(0) = O or k(O) = O imply g = O, which is not the case). Now g(x) = 
dE(x), where E satisfies (E). This yields the solution (iti) case (i). Similarly, 
g =0, f,h,k 4 0 gives the solution case (11) of (iii). 

Finally, we treat the case where /f, g,,k are not constants. We will show how 
(VS) and (3.27) arise naturally to determine the solution (iv). 

First we consider the case (0) = 0 or k(O) = 0. Suppose A(0) = 0. Then x = 0 
in (3.103) gives 


g(x — y) = g(x)g(-y) 


f(y) + g(-y) =0 
and 
fQ +x) — f(y — x) =h&)k), 
which is (VS). By Theorem 3.59, this gives the solution (iv) case (i) (a). 
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When k(0) = 0, y = 0 in (3.103) shows that 
f(x) + g(x) =0 


and 
fe +y)— f@—y)=AG)kO) forx,y EG. 

This is (VS), and Theorem 3.59 yields the solution (iv) case (1) (b). 

Lastly, we consider the subcase (ii), 4(0) 4 0, k(O) 4 0. Set y = x and y = —x 
separately in (3.103) to get 
(3.103b) f (2x) = h(x)k(x) +5, 
(3.103c) g(2x) =hA(x)k(—x) +7, forx eG. 

From (3.103), (3.103b), and (3.103c), letting x + y = 2u, x — y = 2v (using that 
G is 2-divisible), we have 
(3.104) hlu+ov)k(u—v) = fu) + g(2v) 

=h(u)k(u) +h(vo)k(—v) +c5 +c7. foru,v € G. 


Let v = u and v = —u in (3.104) separately to have 
(3.105) h(2u)k(0) = h(u)(k(u) + k(—u)) + ¢5 +7, 
(3.105a) k(2u)h(O) = k(u)(h(u) +h(—u)) +e5+c¢7,  forue G. 


Interchange u and v in (3.104) to obtain 
(3.104a) hA(u+ov)k(v —u) =hA(v)k(v) +h(u)k(—u) +¢5 +¢7 foru,v €G. 
First add (3.104) and (3.104a) and use (3.105) and (3.105a) to get 
h(u+v)2e(u — v)k(O) = h(u)2e(u) + h(v)2g(v) — 2c5 — 2c7 
= h(2Qu)k(0) + h(2v)k(0), 
where 
(3.106) 20(x)k(O) = k(x) +k(—x) forx €G. 
Setting u + v =x andu —v = y, using k(0) 4 0, we have 
h(x+y)+h@ —y)=2h(x)p(y) forx,y €G, 
which is (3.27). From Theorem 3.41, it follows that the nonzero solutions are 
(3.107) g =1, his Jensen (J); that is, h(x) = A(x) +B, 


_ Ke) +k) _ EQ) + EC) 


p(x) 2K (0) 5 : 


(3.107a) F 
h(x) = 7 (EG) + E(—x)) + D(E(x) — E(-x)), 


where A satisfies (A) and E satisfies (E). 
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First let us consider the case (3.107a) h(x) = A(x) + b and obtain the solu- 
tion (iv) case (11) (a). 
Subtracting (3.104a) from (3.104) and utilizing (3.106) with g = 1, we get 


A(u + v)(k(u — v) — k(0)) = ACU) (Ku) — K(O)) — Av) (K(v) — K(O)) 
(cf. (3.95a)). Replace u by u + v in the equation above to obtain 
a(u) + 2A(v)(k(u) — k(O)) = atu + v) — a(v), 


where a(x) = A(x)(k(x) — k(0)) for x € G. 
Interchanging u and v yields 


A(v)(k(u) — k(0)) = AW)(K(0o) — k(0)); 


that is, 
ku) —k(0)=b,A(u) forue G, 


where b, is constant. 

These values of h and k in (3.104) and (3.104a) give f and g and produce the 
solution (iv) case (11) (a). 

Finally, we tackle (iv) case (ii) (b). Now (3.107a) holds. 

Subtracting (3.104a) from (3.104), we get 


(3.108) h(u+ov)atu—v)=h(ujatu) —h(v)a(v), u,v € G, 


where a(u) = k(u) — k(—u) for u € G is odd (cf. (3.95a) and (3.95)). 
Change v to —o in (3.108) and use that @ is odd to obtain 


h(u + v)a(u — v)—h(u — vya(u + v) = —a(v)(h(v) + A(—v)) 
= —h(O)a(2v) (put u = v on the left side), 


which by letting vu + 0 = x and u — v = y becomes 
A(O)a(x — y) = —h(x)a(y) +a@)h(y) forx, y €G, 
which is (3.73a). Change y to —y and add to have 
AO) +y) +a —y) =a@)hty) +h(-y)), 


which is (3.27) with 29(y) = ho EME») | 
We know from (3.107a) that c= E(y)+ E(-y), so we need only a(x) from 
Theorem 3.21: 


ale) = k(x) — k(x) = LEC) + E(x) + bE) ~ E(-9)). 
From a(x) and (3.106) with (3.107a), we get k(x). Then (3.103b) and (3.103c) 


determine f and g. This solution results in (iv) case (ii) (b). This completes the 
proof of this theorem. 
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3.4.14 Levi-Civita Functional Equation, Convolution Type Functional 
Equations, and Generalization of Cauchy-Pexider Type and d’Alembert 
Equations 


The object now is to discuss the most general functional equations 


n 
(3.109) f(x +y) = Do Aj(x)ki(x) — (Levi-Civitd equation (Székelyhidi [794])) 


i=1 


and 

(3.109a) fe +y)t+ex—y)= > hiWKGO) forx,y €G, 
i=1 

(3.109b) fatyg@—y)= SAK) forx,y €G, 


where f, g, hj, kj (i = 1 to n) are complex-valued functions defined on a group G. 
Equations of the form (3.109), (3.109a), and (3.109b) have been studied by many 
authors in [44, 9, 182, 630, 631, 551, 594, 596, 717, 796, 653, 295, 790, 747, 787, 
761, 794, 818], etc. The equation (3.109) is a common generalization of Cauchy- 
Pexider (additive, exponential) equations, whereas equation (3.109a) includes the 
above and more—Jensen, cosine, sine, (VS), (3.102), (Q), square norm equations, 
etc. 

The functional equation (3.109), known as the Levi-Civita equation (Székelyhidi 
[787]), has been studied by many authors under various assumptions, such as dif- 
ferentiability, measurability, etc. The earliest general results concerning (3.109) are 
due to Vincze [818]. He introduced the method of determinants to solve special 
cases of (3.109). In McKiernan [630], it was shown that (3.109) is closely related to 
the matrix equation (E) and can be solved by simultaneous diagonalizations of the 
matrix E(x), and the solutions can be expressed in terms of additive and exponen- 
tial functions where the additive functions satisfy additional functional equations. 
Also McKiernan [630, 631] obtained the general solution of (3.109) under some 
conditions, and in Sinopoulos [747] it is shown that the solution is an exponential 
polynomial. 

In Székelyhidi [797], it was shown that (3.109a) has significant applications 
in signal processing. Notice that the left side of (3.109a) is the general solu- 
tion to the wave equation, while the right side is the inner product of vectors 
h(x) = (hi (x),...,4n(x)), K(x) = (ki), ..-,kn(@)). We will consider this in- 
ner product form a little later. 

In Rukhin [717], the author obtains conditions for the solutions of (3.109a) to 
be matrix elements of a finite-dimensional representation of the group. Whereas in 
Székelyhidi [790] the author tackles the problem of determining the general solution 
of (3.109a) raised by Aczél in [9], Székelyhidi and Aczél obtain the solution of 
(3.109a) by reducing it to matrix equation (E) and (C) as exponential polynomials 
(see inner product version). Here are some interesting results. 

Let M,,(C) denote all n x n matrices with complex entries. 


3.4 Solution of (CE) on a Non-Abelian Group 203 


Result 3.65. [747, 630, 631]. Let G be a topological Abelian group and f, hi, k; 
G—> CG =1,2,...,n) be continuous solutions of (3.109). Then f is an expo- 
nential polynomial. 


Result 3.65a. [787]. Let G be an Abelian group, n a positive integer, and f, kj, hj : 
G => C be functions (i = 1,2,...,n) where both the sets (ki, k2,...,kn) and 
(h1,h2,...,hn) are linearly independent. The functions f,kj,hj @ = 1,2,...,n) 
form a nondegenerate solution of (3.109) if and only if there exist positive in- 
tegers k,nj,n2,...,nk with ny +na+... + ne = n, different exponentials 
E\, E2,..., Ex, and for every j = 1,2,...,k there exist linearly independent sets 
(Aji, Aj2,--.,Ajnj;-1) Of additive functions and polynomials Pj, Qi j, Rij @ = 
1,2,...,n) inn; — 1 complex variables and of degree at most nj; — 1 such that we 
have for all x in G 


fQ)= 3 P;(Aji(x), Ajo(x), ..., Ajnj—1(®)) Ej), 


j=l 


ki(x) = > Qi; (Aji(x), Ajo), .. «5 Ajnj—1 8) Ej), 


j=l 


hj(x) = y Rij(Aji(x), Aja(), ..., Ajnj-1@)) Ey @). 


j=l 


Result 3.66. [790]. Let G be a topological group and f : G — M,(C) be a 
continuous solution of (C) with f(0) = I, and f (xo)? — I is invertible for some 
x0 € G. Then, there isan E: G + M,C) satisfying (E) such that 


f@)= SIE) + EC x)] 


holds for all x € G (see Kannappan [424]). 
The proof follows the method in [424]. 
Result 3.66a. [790]. Let G be a 2-divisible topological Abelian group and f : G > 


M,,(C) be a continuous function satisfying (C). If f (0) = I, then every component 
of f is an exponential polynomial. 


Result 3.67. [790]. Let G be a 2-divisible, locally compact Abelian group and 
St, g hi, ki: G — C be measurable solutions of (3.109a). Then f and g are expo- 
nential polynomials. 


3.4.15 Operator-Valued Solution of (C) 


As mentioned earlier, the d’Alembert functional equation (C) has a long history, 
studied under different conditions, with a rich literature. Also, the operator-valued 
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versions have been studied by several authors, including [181, 621, 421, 282, 796]. 
Let L(H) denote the set of all bounded linear operators in a Hilbert space H and 
N(#) all normal operators on H. 

A mapping f : G > L(#) satisfying the equation (C) with f(0) = J is called 
a cosine operator function, and that of f : G — N(H) satisfying (C) is called a 
normal cosine operator function where G is (an Abelian) a group. 


Result 3.68. (Stackel [763]). Let G be an Abelian group and H be a Hilbert space. 
Suppose f : G — N(A) is anormal cosine operator satisfying (C). Then there is a 
homomorphism (exponential) E : G — N(HA) (multiplicative semigroup) such that 


(3.110) f@)= 5(EO) + E(—x)) 


holds for all x € G (see [424]); that is, f is an even part of a homomorphism of G 
into the multiplicative semigroup of N(H). 


Cosine operator functions have also been studied on non-Abelian groups, where 
usually the commutativity of G is replaced by the condition (K) (see [424]). 


Result 3.68a. [763]. Let G be a group and H a Hilbert space. Then any operator 
f :G—> N(A) satisfying (C) with (K) has the form (3.110). 


Corollary 3.68b. Let G be a group, Ho an inner product space, H its completion, 
and f : G + N(Apo) 4a solution of (C) with (K). Then f has the form (3.110), where 
E :G—= N(A) is a homomorphism. 


In applications, we frequently encountered many specific cases of 
equation (3.109a); for example, Jensen, quadratic, d’ Alembert, or cosine, and Wilson 
(see [12, 40, 398, 424, 830, 831]). The general solutions of these equations are 
usually obtained by solving them using procedures tailored to the particular equa- 
tion; that is, there is no universal scheme to cover many of them at the same time. To 
extract the solution to these special cases from [717, 790] is not simple. The frequent 
occurrences of these and similar equations warrants the listing of general solutions 
to special equations of lower order n. With this in mind, we now treat the functional 
equation 


(3.111) fa +y)+ f&@—y) = f@)s) + s@QFY) 

in Result 3.69a. In the process, we solve some key special cases and encounter some 
conditional Cauchy equations, which is an area that has received much attention 
lately (see [218, 422]). 

3.4.16 Solution of Equation (3.111) 


Result 3.69. Suppose E : G — C* is exponential. Let f : G + C be a mapping 
satisfying the functional equation 


(3.112) faety)+ ff —y) = (E@) + Ex) FO) + F@)EO) + E(-y)) 
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forx,y € Gwith 


(K) fixty+z=fe+z+y) forallx,y,z€G. 
Then f must be of the form 
f(x) = A(x)(E(x) + E(—x)) when E(x) # E(—x) 
or 
f (x) = Eo(x)B(x,x) when E(x) = E(—x) = Eo(x) (say) 


and A: G — Cis additive and B : G x G > C is biadditive. 
The converse is also true. 


Result 3.69a. Suppose f (4 0), g : G > C are mappings satisfying the functional 
equation (3.111) with (K). Then they have one of the following forms: 

f(x) = A@)(E(x) — E(—x)), when E(x) # E(—x), 

g(x) = E(x) + E(-x), 

F(x) = Eo(x)BQx, x), when E(x) = E(-x) = E(x) (say), 

8 (x) = 2Eo(x), 


f(x) = ylEi(x) + E1(—x) — E2(x) — E2(-x)], k £0, 
g(x) = S[E (x) + Ey (—x) + Eo(x) + Ex(-x)], £1 # E2, 


where A is additive, Ey : G > C, E, Ej : G — C* are exponential, and B : 
G x G > Cis biadditive. Conversely, for nonzero A and B, f and g given above 
satisfy (3.111) with nonzero f. 


3.4.17 Inner Product Version of (3.109) 
First of all, (3.109) can be rewritten as 
f(x +y) = (h@), ky) forx,y €G, 


where h(x) = (hi (x),...,4n(x)), k(x) = (k(x), ...,kn(&)), and (, ) is the stan- 
dard inner product in C”, where f, h, k are complex-valued functions on G. 

In O’Connor [653], the author determined the continuous solutions of the func- 
tional equation 


n 


(3.109c) fx—y) => lac(x)ar(y) forx,y € G, 
k=1 


where f, ax : G > Cas 


f(x) = Do be A(x), 


k=] 
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where 4; are continuous characters of G and by are constants. This result was 
obtained by means of the Bochner representation theorem for positive definite func- 
tions. In Gajda [295], the author found the forms of a;s5 also. Equation (3.109c) can 
be put in the form 


(3.109d) f(x —y) = (atx), a(y)) forx,y EG, 


where a(x) = (a1 (x),...,@n(x)) is a mapping a : G > C”. If C” is replaced by a 
Hilbert space H over C, solutions of (3.109d) can be described in terms of a unitary 
representation of G. 

Recall that a homomorphism g : G — U(#) (all unitary operators on H) is 
called a unitary representation of a topological group G. 

A unitary representation U is said to be continuous provided, that for every 
s € H, the transformation 6 : G — AH given by f(x) = U(x)s, for x € G, is 
continuous. 

An element so € H is called a cyclic vector of a representation U provided 


c€ span{U(x)s:x €G}=H, 


where cf stands for the closure operator in H and span{S} means the linear space 
generated by a subset S of H. 


Result 3.70. [295]. Let G be a topological group and H be a Hilbert space over C. 
Suppose that f : G + Canda: G — H are continuous functions. Then f and 
a satisfy (3.109d) if and only if there exists a continuous unitary representation U 
of the group G in the space Ho := c€ span{a(G)} with the cyclic vector so = a(0) 
such that 


a(x) = U(x)so, xeG, 
f(x) = (U(x)s0, 50), x €G. 


Let Minn (C) denote the set of all m x n matrices over C and 6; ;, the Kronecker 
delta given by 


a 0, fori £ j, 
1, fori =j. 


Result 3.71a. [295]. Let G be an Abelian topological group. Then continuous func- 
tions f,aj:G—> C(j =1,...,n) satisfy (3.109d) if and only if there exist contin- 
uous characters X,..., Xm of the group G (m <n), complex numbers b1,..., bm, 
and a matrix B € Mmy(C) such that 


n 
> i iPej = On a c— e , 


j=l 
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f@) =i bP), x €G, 
i=l 


m 


aj(x) = > diBjilx), forx éG, j=H=1,...,n. 
i=1 


3.4.18 A Functional Equation of d’Alembert’s Type 


We will now consider d’Alembert’s equation in the general form (inner product 
version) [295] 


(3.113) f@+y)+ fx —y) =2(ax),a(y)) forx,y €G, 


where f :G— C,a:G— H, Ga group, and H a Hilbert space over C. 


Theorem 3.71. (See also [295].) Let G be a topological group (not necessarily 
Abelian) and H a Hilbert space over C. Suppose f : G > Canda:G—> H 
are continuous solutions of equation (3.113) with f satisfying (K) and there exists a 
zo € G such that a(zo) = 0. Then 


(3.113a) f(x) = (a(x), a(0)) forx €G, 


and there exists a continuous unitary representation U of G in Ho = c€ span{a(G)} 
with the cyclic vector sy = a(0) such that 


(3.113b) a(x) = 5(U(x)s0 + U(—x)so), x €EG, 
and so 
(3.113c) f@ms= 5(U(x)so + U(—x)so, 50), x EG. 


Conversely, if f : G+ Canda:G — H are given by (3.113b) and (3.113c) with 
some unitary representation U and cyclic vector so, then equation (3.113) holds. 


Proof. The converse is easy to check. Assume that (3.113) holds. Set y = 0 (identity 
in G) in (3.113) to get 


(3.113a) f(x) = (a(x),a()) forx eG. 


Note that no continuity is used. 
Changing y to —y in (3.113), we have 


(a(x), a(y)) = (a), a(-y)) forx,y €G, 


which implies that a(—y) = a(y) for y € G. So, a is even and then, from (3.113a), 


fx) = f(Ex),x €G. 
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Now, set x = 0 in (3.113) to obtain 


f(y) = (a), a) = (a), @)) = FO). 


Condition (K) on f implies that (i) a satisfies (K) (use (3.113)) and (ii) f(x + y) = 
f(y +x) for x, y € G, which also follows from taking conjugates on both sides of 
(3.113), 


(a(x), a(y)) = (a(x), a(y)) = (a(y), a(x)), 
and use (3.113) again. 
Setting y = zo in (3.113) and using a(zo) = 0, we have 
f(x +20) + f@ — 20) =0 
(3.114) or f(x+2z0)+ f@) =0 
or f(x)+ f(x —2z0) =0 forallx €G. 


From (3.113), (3.114), and (K), for x, y € G we obtain 
(a(x),a(y)) + (aa + 20), @(y + Z0)) 
= SG +y) + FO —y) + Fle +20 + y +20) + fe + 20 — 20-9) 
(3.115) 
= SUE + +O —)- FOF FSO fey), 
Define g: G > C, 8: G—> H by 


(3.116) g(x) = f(x) +if@+z0) forx eG, 
(3.116a) B(x) =a(x)+ia(x+z0) forx € G. 
Note that g(x + y) = g(yv +x) and (x+y) = f(y +x) forx, y € G, and g and £ 


satisfy (K). 
Using (3.115) and (3.114), for x, y € G, 


g@—y)=f@—y)+if@—ytz0) 
= (a(x), a(y) =) + (a(x + 20), a(y + Z0)) 
+ i(a(x +20), a(y)) + (a + 2z0), a(y + Z0)) 
= (a(x) +ia(x + 20), a(y) +ialy + z0)) 
(3.116) = (B(x), B(y)), 


which is the same as (3.109d). By Result 3.70, there is a continuous, unitary repre- 
sentation U of G in Ho = c€ span{f(G)} with cyclic vector so = £(0) = a(0) such 
that 


(3.116b) B(x) = U(x)s0, 
(3.116c) g(x) = (U(x)s0, 50), forx € G. 
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In view of (3.113), (3.114), and the (K) condition on f, 
2(a(x + 20) + a(x — 20), a(y)) = Ff +z0+y) + Ff +20-y) 
+ f(x —zZo+y)+ f(* —z0—y) 
= f@+yt+z0)+f@ +y —20) + f( —y +20) 
+ f(« —y — 20) 
=0 forallx,y EG. 
Thus 
(3.117) a(x +z0) ta(x —zo) =0  forallx EG. 
Now, turning to the definition of £ in (3.116a), using (3.117) we see that 
B(x) + B(-x) = a(x) +a(—x) = 2a(x) forx € G, 


span{a(G)} = span{f(G)}, and so so € cé span{a(G)}. Now, (3.113b) follows from 
(3.116b), and (3.113c) follows from (3.113a) and (3.113b). This completes the proof 
of this theorem. 


Finally in this chapter, we can deduce from Theorem 3.71 the following result. 


Result 3.72. [295]. Let G be an Abelian topological group and let f,aj; : G > 
C GU = 1,...,n) be continuous functions. Moreover, suppose that there exists a 
zo € G such that 

aj(zo)=0 for j =1,...,n. 
If the functions f and aj; (j = 1,...,n) satisfy the equation 


n 
(3.118) fe+ty)+ f@-—y)=2> aj@)ajO), x, yEG, 
j=l 
then there exist continuous characters X|,..., Xm of the group G (m <n), complex 
numbers bi, ..., bn, and a matrix B; ; € Minn(C) such that 
n 
Ais =on: Le liieagtit 


j=l 
m 
f(x) = >o [bi)?RerXj(x) x € G: 
i=l 
m 
t= > Fe Re): FEC, FH liana 


i=1 


3.5 Survey—Summary of Stetkaer 


We conclude this chapter with extracts from the beautiful, fascinating article (from 
a talk at the Functional Equation meeting held in Opava in June 2004 (ISFE42)) by 
Stetkaer [750]. 


210 3 Trigonometric Functional Equations 
3.5.1 Abstract 


I will describe the solutions f : G — C of classical functional equations such as 
those of d’Alembert, Wilson, and Jensen and the quadratic functional equation on a 
group G. Most of the previous studies assume that G is Abelian or at least that the 
solution f satisfies Kannappan’s condition 


(K) f(xyz) = f(xzy) forallx,y,z€G. 


The emphasis will be on what can be said about the solutions if G belongs to certain 
classes of non-Abelian groups such as the nilpotent groups and the semisimple Lie 
groups and Kannappan’s condition is no longer enforced. 

Throughout this part, let G be a group with neutral element e. For x,y € 
G, [x,y]= wey! denotes the commutator of x and y and G’ = [G, G] denotes 
the commutator subgroup of G generated by all commutators (see Remark 3.22). 

Note that G’ is a normal subgroup of G and that G/G’ is Abelian. 

Of course, if G is Abelian, then G’ = {e}. At the other extreme, if G is a semi- 
simple connected Lie group like SL(2, R), then G’ = G. 

Given a subgroup H of G, we consider the coset space G/H = {xH | x € G}. 
Let X be any set. A function f : G — X is a function on G/H if it is constant 
on each coset xH; ie., if f(xk) = f(x) for all x € Gandk € H. We will not 
distinguish between the function f on G and the corresponding function on G/H. 


Definition. Let f : G — X, where X is a set. We say that f satisfies Kannappan’s 
condition if it is a function on G/G’, or equivalently if (K) holds. 


If G is Abelian, then any function on it satisfies Kannappan’s condition. 

If f is a function on G, we let f* denote the function f*(x) = f(x7!), x €G. 

For any topological space X, we let C(X), Cp(X) and C,(X). denote the algebra 
of complex-valued continuous functions on X, the subalgebra of bounded functions, 
and the subalgebra of compactly supported functions. 

G is a locally compact Hausdorff topological group with identity element e. The 
group operation will be written multiplicatively unless the group is Abelian; in that 
case, we use +. C(G) denotes the algebra of all continuous complex-valued func- 
tions on G equipped with the topology of uniform convergence on compacta. 


3.5.2 The Abelian Solution and Related Extensions of It 


Due to the fact that the classical functional equations for many years have been 
studied intensively on Abelian groups, it is interesting that something new has 
emerged in the last five years. Sinopoulos [775] considered the following versions of 
d’ Alembert’s equation, Jensen’s equation, and the quadratic equation, respectively: 


g(x +y)+ ee +ay) = 2g()a(y), x,yeS, 
g(x +y)+g@+t+oay) = 2g(x), x,yes, 
g(x +y)+g@toy) =2g(x)+2g(y), x,y ES, 
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where the underlying space S is a commutative semigroup (not necessarily a group) 
and o is an endomorphism of S such that o (ox) = x forall x € S (see also Sinopou- 
los [745]). The range of g is respectively a quadratically closed (commutative) field 
of characteristic different from 2, a 2-cancellative Abelian group, and an Abelian 
group uniquely divisible by 2. The solution formulas of Stetkaer [775] are the same 
as in Sinopoulos [743], so the new feature is that the solution formulas extend from 
groups to semigroups and from C to more general structures. 
Many functional equations, such as d’ Alembert’s functional equation 


© fa+y)+fa—-y)=2f@)fO), x,y ER, 


have been extensively studied on Abelian groups such as G = R. Here f : R-> C 
is a function that we want to know more about. 

To set the stage for the further discussion, let me mention how the situation is on 
the real line: The continuous nonzero solutions of (C) are the functions of the form 


iAx —iAx 
fi (x) = — XE R, 
2 
where / € C; 1.e., the complex cosine functions (see Theorem 3.9). 
The functional equation (C) begs to be generalized to any group G by the pre- 


scription 


(CE) f(xy) + fy!) =2f@)f0), xy EG, 


where f : G > Cisa function that we want to know more about. We use the name 
d’Alembert’s functional equation for the functional equation (CE). 

We shall first consider the case where G is Abelian. If G is Abelian or just if 
f satisfies Kannappan’s condition (K), then Kannappan in his seminal paper [424] 
from 1968 found that the nonzero solutions of (CE) are the functions of the form 


* 
(3.37) eae 

: 2 
where y ranges over the homomorphisms y : G —> C*. In the example of the 
continuous solutions on G = R, we have y (x) = e!** for x € R. The character y is 
essentially unique: If y; : G + C* is ahomomorphism and 


PF _ 


ty 2 2 ’ 


then either yj = y or y1 = y* (see Lemma 3.16). 

Stetkaer [775, 770] has studied vector- and matrix-valued solutions of Wilson’s 
functional equation on Abelian groups G such that G = 2G (= 2-divisible groups). 
Let F denote an algebraically closed field of characteristic 0. He determines in [775] 
the general solution of 


Fixt+ty)+F—y)=a(y)FQ@), x,y eG, 
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where a is a3 x3 matrix-valued function and F a vector-valued function with linearly 
independent components, both with entries from F. Using that, he can write all 
solutions f : G — F of the scalar equation 


f@t+y)+f@—y) = gi@)ri) + g2@)ho) + a3@)A3(y), x,y EG, 


on a long list. 
In [770] Stetkaer determines the general solution of f : G — F of 


2 
f@+ty)-f@-y) => sii), x, yEG. 


i=1 


The new feature is the minus on the left-hand side. 

Let G be a locally compact, Hausdorff topological group. We let K be a compact 
Hausdorff topological transformation group of G acting by automorphisms on G. 
We let k - x denote the action by k € K on x € G. The normalized Haar measure on 
K is denoted by dk. An important special case is K = Z. = {+1}, where 


| f (k)dk = JD), 
K 2 


The most recent result is by Shin’ ya [734, Corollary 3.12], but important earlier 
works are due to Kannappan [424] and Chojnacki [156]. 


Theorem. (Kannappan [424], Czerwik [186], Sinopoulos [744]). Let G be Abelian. 
If @ € C(G) is a nonzero solution of 


f d(x +k-y)dk=(x)6(y) forallx,y €G, 
K 


then there exists a continuous homomorphism y : G — C* such that 
(x) =) y(k-x)dk forallx €G. 
K 


If & is bounded, then x may be taken as a unitary character (i.e., a continuous 
homomorphism of G into the unit circle). 


Let G now be a locally compact, Hausdorff topological group that is not neces- 
sarily Abelian, and let K be acompact subgroup of G. In classical harmonic analysis, 
you encounter not just functions on G/K but even functions f : G — C that are 
K-bi-invariant; i.e., such that f(kjxk2) = f(x) for all kj,k2 € K andx € G, or 
equivalently are functions on K\G/K. We say that (G, K) is a Gelfand pair if the 
convolution algebra C,.(K \G/K) is commutative. This is a weaker requirement than 
G being Abelian. 

We note that the continuous (algebra) homomorphisms ® of C.(G) onto C are 
the mappings of the form 
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a: fre | Feoyplods, 


where ¢ : G — C ranges over the continuous (nonunitary) characters of G (i.e., over 
the continuous functions ¢ 4 0) such that ¢ satisfies (M). So this is an equivalent 
characterization of the continuous characters. 

Let K be a compact subgroup of G. The continuous (algebra) homomorphisms 
® : C.(K\G/K) — C are the mappings of the form 


Oo: fr [ F(x)bn)dx, 


where ¢ : G — C ranges over the spherical functions (i.e., over the continuous 
functions ¢ 4 0) such that 


i: d(xky)dk = $(x)d(y)  forallx,y EG. 
K 


Here dk denotes the normalized Haar measure on the compact group K. So char- 
acters must be replaced by the more general notion of spherical functions, and the 
underlying space will be K\G/K rather than G. 

This philosophy has been taken up by M. Akkouchi and E. Elqorachi and other 
Moroccan mathematicians. Kannappan’s condition (K), which replaces the condition 
about G being Abelian, is in this setting substituted by 


(K’) | | townoatan = ff pokxneyatar for all x,y,z € G. 
KJK KJK 


As a sample, we mention two (corollaries of) results from [252, Theorem 2.1 and 
Theorem 2.2] that are in complete analogy to Kannappan’s earlier result. It should 
be mentioned that the results of [252] are much more general. 


Theorem. (Elqorachi and Akkouchi [257]). Let g € Cp(G) be a nonzero solution of 
the generalized d’Alembert functional equation 


(3.119) | e(rkyydk-+ [ g(xky—!)dk =2g(x)g(y), forallx,y €G, 
K K 


satisfying the generalized Kannappan condition (K’). 


(1) Then there exists 6 € Cp(G) satisfying (3.119) such that 
e+e 
£-=—y 


(ii) If 


where , $1 € Cy(G) satisfy (3.119), then either d) = ¢ or d) = ¢*. 
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Proof. The proof goes along the same lines as Kannappan’s original proof from 
[424]. 


Kannappan’s condition is clearly satisfied if G is Abelian. A similar result holds 
in the present framework. 


Result. [257]. Let (G, K) be a Gelfand pair. Then any solution g € Cp(G) of the 
generalized d’Alembert functional equation 


| g(xky)dk + | (Gk dk 2eGeG). jorailes ee, 
K K 


satisfies the generalized Kannappan condition (K’). 


In a recent paper [130], B. Bouikhalene and E. Elqorachi examine the functional 
equation 


(3.120) | | Fork yyZ(WOdkdult) = g@hy), x,y €G, 
GJK 
where yw is a Gelfand measure on G, dk denotes the normalized Haar measure of 


K, and y is a character of K, and where f € Cp(G) satisfies the Kannappan type 
condition 


(K”) | J Fesxty\du(dn@® = - i Faesyte\duls\du) 


for all x, y, z € G. The equation encompasses as special cases the functional equa- 
tions 


| | flctk-y)dkdu(t) = fO@)FO), xy €G, 
GJK 
I fOr. age 


[ (Cer OU—EiG). 29ee 


| Flety)du() + | Flxto(y))du(@) =2e(x)h), x,y €G, 
G G 


where o is a continuous involution of G. 

A new feature of [130] is that K acts as a morphism on G, not necessarily as an 
automorphism in equation (3.120). 

In [769], Stetkaer considers 


N 
Gif Fek-yWDak= Yee), xy eG, 
me é=1 
in which f, g1,...,gN,/41,...,4N € C(G) are unknown functions. Here y : K > 
{z € C | |z| = 1} is acontinuous homomorphism of the group K into the unit circle. 
SoG, K, x, and WN are given in advance. 
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Stetkaer [769, Theorem 4.8] shows that each solution of (3.121) can be expressed 
in terms of an N x N matrix-valued K-spherical function (i.e., a continuous map 
® € C(G, M(N, C))) such that 


fl O(x(k- y))dk = ®(x)®(y),_ forallx,y € G, and O(e) = J. 
K 


The paper studies the matrix-valued K -spherical functions associated with the solutions 
of equation (3.121) and applies this knowledge to the solutions of various functional 
equations of the form (3.121). 


3.5.3 d’Alembert’s Functional Equation 


As mentioned above, d’ Alembert’s functional equation on a group G is the functional 
equation (CE), where g : G — C is a function that we want to know more about. 
From Kannappan’s paper [424], we know its solutions if G is Abelian or if just g 
satisfies Kannappan’s condition (K). If G is not Abelian, the solution set is not 
known in general. 

If y : G > C* is a character, then the function g = (y + y*)/2 is a nonzero 
solution of d’Alembert’s functional equation on G. We will call solutions of this 
special form and the zero solution classical solutions. The constant function g = | 
is an example of a classical solution on any group, and it is on some groups the only 
classical solution apart from zero; for example, on simple groups and connected 
semisimple Lie groups because G = [G, G] for such groups. 


Example 1. The function g(x) = Atr(x), x € SL(2, C) (Le., the normalized trace) 
is a solution of d’ Alembert’s functional equation on the group G = SL(2, C), as an 
elementary computation reveals. Apart from the (trivial) classical solution g(x) = 
1 that exists on any group, I know of no other nonzero solutions of d’Alembert’s 
functional equation on SL(2, C). 


Example 2. Ng [40, Remark 5] gives another example of a nonclassical solution 
g of d’Alembert’s functional equation. The group is the group of unit quaternions 
{a+bi+cj+dk | a*+b*+c?+d? = 1}, and the functionis g(a+bi+cj+dk) =a. 


Actually, the two examples are related. Viewing the group of unit quaternions 
as SU -, Cc SL(2, C), it turns out that g is the restriction of the normalized trace 
function 4 zt from Example | to SU(2). Restricting still further to the quaternion 
group Os = {+1, +i, +j, +k}, we get Corovei’s [173] example of a nonclassical 
solution of d’ Alembert’s functional equation. It is the only such one on Qs. 

The groups SL(2, C) and SU (2) in Examples | and 2 above are connected semi- 
simple Lie groups, so they are far from being Abelian. Indeed, [G,G] = G, so 
the commutator subgroup is as large as it can be. For nilpotent groups, the situation 
is the converse because the commutators by definition become successively smaller 
and after a finite number of steps reduce to {e}. An example is the Heisenberg group 
G = H;(R), defined by 
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1x z 
A3(R) = y(%, y,z)= (Oly) |x,y,ze RTF, 
001 


where [G,G] = Z(G) (the centre of G), so [G,[G, G]] = {e}. The Heisenberg 
group is an example of a connected nilpotent Lie group (see examples in the section 
“Solutions of (CE) on a non-Abelian group” not of the form (3.37)). 


Result. (de Place Friis [292]). Let G be a connected nilpotent Lie group and K a 
quadratically closed field with characteristic different from 2. Any nonzero solution 
g :G — K of d’Alembert’s functional equation (CE) or even of the more general 
functional equation 


(3.122) —-g(xy) + gx) + gry!) + g(y |x) = 4e¢@)g0), x.y €G, 
is classical; i.e., there exists ahomomorphism y : G — C* such that (3.37) holds. 


If G is not a connected nilpotent Lie group, then new phenomena occur, even for 
step 2 nilpotent groups (i.e., for groups G such that [G,[G, G]] = {e}). We have 
already observed this for Qg = {+1, +i, +7, +k} (the quaternion group). Here go, 
defined by go(+1) = +1, go(+i) = go(4j) = go(tk) = 0, is a nonclassical 
solution of d’ Alembert’s functional equation (actually the only one). 


Proposition. [771,266]. On any step 2 nilpotent group, the solutions of d’Alembert’s 
long functional equation (3.122) are the same as the solutions of d’Alembert’s func- 
tional equation. 


3.5.4 Wilson’s Functional Equation 
By Wilson’s functional equation on the group G we mean the functional equation 


(3.27) fay) + fry ') =2f@)giy), x yEG 


(see Theorems 3.11 and 3.41), where f, g : G + Care to be determined. For g = f, 
Wilson’s functional equation becomes (C), but there are more relations between the 
two functional equations. 


Theorem. [292]. Let G be a connected nilpotent Lie group. Let the pair f,g : G > 
C be a solution of Wilson’s functional equation such that f 4 0. Assume finally that 
g is not identically 1. Then there exist ahomomorphism y : G — C* and constants 
a,b € C such that 

as ae ae tay 


b = 
+ g 5) 


= 5 


Conversely, if f and g have this form, then the pair f, g is a solution of Wilson’s 
functional equation. 
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So the formulas for the solutions are the same as on an Abelian group except 
possibly for the case of g = 1. The special case of g = 1 reduces Wilson’s functional 
equation to Jensen’s functional equation. 


Result. [776]. Let G be a step 2 nilpotent group. Let the pair f,g : G—> C 
be a solution of Wilson’s functional equation (3.27), in which g has the form g = 
(y + y*)/2 for some homomorphism y : G — C*. Then: 


(a) Ify # y*, then f has the form 


fee ge 
ey ee ae ee 
f=a 5) + 5) 
for some constants a,b € C. 
(b) If y = y*, then f has the form f = Fy, where F : G > Cis a solution 
of Jensen’s functional equation on G, i.e., F (xy) + F(xy7!) = 2F (x) for all 
x,yeEG. 


If y = y*, then y = 1 on groups that are generated by their squares, which is 
the case for connected Lie groups. 

If g is nonclassical, we obtain the following result, in which gg denotes the 
solution of d’Alembert’s functional equation on the quaternion group Qg given by 
go(+1) = +1, go(ti) = go(t+J) = go(tk) = 0. 


3.5.5 A Variant of Wilson’s Functional Equation 
In this section, we discuss the functional equation 


(3.123) f(xy) + f(y 'x) =2f@eiy), x,y EG, 


where f, g : G — C are functions on G for which we want information. The func- 
tional equation (3.123) is very similar in appearance to Wilson’s functional equation 
(3.27) but differs from it in the second term, where the new equation has f (y1x), 
while Wilson’s equation has f(xy~'). The two versions of course agree if G is 
Abelian. 

The Heisenberg group with integer entries shows that equation (3.123) and 
Wilson’s equation (3.27) may have different solutions (see [768, Example IX.2]). 
Actually, neither solution set is contained in the other. 


Result. [768]. Let the pair (f, g) : G > C bea solution of equation (3.123) such 
that f #0. Definemg : G > C bymg(x) = 2g (x)? — g(x), x € G. Then 


(a) f(xyx7!) = mg(x) f(y) for all x, y € G. 
(b) mg is a homomorphism of G into the multiplicative group {+1}. 


The homomorphism mg is in many instances identically 1. Then mg = | if one 
of the following conditions holds: 


(c) G is Abelian. 
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(d) G is generated by its squares; i.e., G = (squares). This is the case if G is a 
connected Lie group. 

(e) Gis a finite group of odd order. 

(f) G is a connected topological group and g is continuous. 


Then either 


(1) {f, g} = {cg, g}, where c € C* and g is a nonzero solution of d’Alembert’s 
functional equation that does not satisfy Kannappan’s condition, or 

(2) there exists a homomorphism y : G — C* such that g = (y + x*)/2. This fixes 
the homomorphism y except for the interchange of y and y*. 


(i) Ifx #x*, then 
f=a 


for some constants a,b € C. 
(ii) If y = y*, then f = (a + A)y, where a € Cand A: G > Cis additive. 


, ae 4 
5 


Conversely, any pair {f, g} of functions of the forms described above satisfies 
(3.123). 


So we get the complete solution of the variant (3.123) of Wilson’s functional 
equation if only we can solve d’ Alembert’s functional equation on G. 


3.5.6 Other Equations 


The trigonometric addition formulas 


(3.73) fay) =f@)geO) + fO)s@), x,y EG, 
(3.72a) g(xy) = gg) + f@)fO), x.y EG, 


were solved in [162] during the discussion of the cosine-sine functional equation 


(3.86) fry) = f@)gO)+ fO)g®)+h@)hty), x,y €G. 


Poulsen and Stetkaer [673] continued these investigations by finding explicit formu- 
las for the solutions of the following versions of the addition and subtraction formulas 
for sine and cosine: 


fao(y)) = f@)gd) -s@)fO), x yEG, 
fao(y)) =f@gO)+e@)fO), x yYEG, 
g(xo(y)) = g@)gQ) + f@)fO), x, yEG, 


where o : G > G is ahomomorphism such that o oo = Ig, where Jg denotes the 
identity map. The classical addition and subtraction formulas on an Abelian group 
G correspond to o = 1g. 

We end this section with an example (application) where the cosine equation 
arises naturally. 
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Derivation of (C) 


Let 2y be the angle between two unit vectors, and denote by 2 f (y) the length of their 
resultant. Let us consider two such pairs pj, p2 and qi, g2 with resultants p and q. 
Let 2x be the angle between them (see the figure below). Then 


lpPl=|pit pal =l¢l=lait+ al =2f0), 
lyl=|p+q@l=lpit potgi+42| =2f@)2f(). 


(By the proportionality of the length, 2 f(x) is multiplied by the common length 
2f(y).) 


pt+q=y 


But 


ly] =|pit po+qit+a@l 
=|pitgt p2+q2\ 
=2f(xty)+2f@—y). 


Thus (C) holds. 


4 


Quadratic Functional Equations 


Quadratic functional equations, bilinear forms equivalent to the quadratic equation, 
and some generalizations are treated in this chapter. 

Among the normed linear spaces (n.].s.), inner product spaces (i.p.s.) play an 
important role. The interesting question when an n.1.s. is an i.p.s. led to several char- 
acterizations of i.p.s. starting with Fréchet [291], Jordan and von Neumann [398], 
etc. Functional equations are instrumental in many characterizations. One of the 
objectives of the next chapter is to bring out the involvement of functional equations 
in various characterizations of i.p.s. The basic algebraic (norm) condition that makes 
the n.l.s. an i.p.s. is the parallelogram identity, also known as the Jordan—von Neu- 
mann identity (or the Appolonius law or norm equation), 


(JvN) lx + yll? + llx — yl? = 2[x1? +2Ilyl? forx,y € V, 


where V is an n.l.s. This translates into a functional equation well known as the 
quadratic functional equation, 


(Q) q(x + y)+ qe — y) = 2q(x) + 2q(y) forx,y eV, 


where V is a linear space. 

This chapter is devoted to the study of the quadratic functional equation (Q) and 
its generalizations, which form a basis for the next chapter, on i.p.s. We will show in 
Chapter 5 how (Q) is used in various characterizations of i.p.s. 

Recall that a mapping B : G x G > F (Ga group and F = R or C) is called 
biadditive if B is additive in each variable; that is, if B(-, y) and B(x, -) are additive 


in (-). 


4.1 General Solution and Properties 


Our first result is the following theorem giving the general solution of the quadratic 
functional equation (Q). 
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Theorem 4.1. The general solution g : G > F (Ga group and F = RorC) 
satisfying (Q) and the condition 


(K) qatyt+z=qat+z+y), forallx,y,z7€G, 
is given by 
(4.1) q(x) = B(x, x) forx eG, 


where B: Gx G — F is asymmetric, biadditive function. This function B is unique 
and is given by 


(4.2) B(x, y)= -lqa@+y)—-qa—y)] 


Re Ble 


= zlaG +y)—q(x)—-qQy)] forx,y eG. 


Proof. It is easy to verify that (4.1) with symmetric B and (K) satisfies (Q). 
Conversely, let g with (K) satisfy (Q), and we define B : G x G > F by (4.2). 
Note that g(x +y)=q(y+x), B(O, y) = B(x, 0) = 0 = g(0), and g(2x) = 4q(x) 
since q is even; that is, g(x) = q(x). 
From (4.2), (K), and (Q), we conclude that 


4B(x+y,2z) =qat+y+2z)-—qau+y-—2z) 
=q@+z7z+04+2))+40+z-0+2)) 
q@ =2=(9—2)) =@0 —24+ Y=) 
= 2q(% +z) + 2¢(y + z) — 2g —z) — 2g — Zz) 
(4.3) = 8B(x,z)+ 8B(y,z) forx,y,z EG. 


Now, y = 0 in (4.3) yields B(x, 2z) = 2B(x, z), so that (4.3) is 
Bx +y,z) = BQ,z)+ B(y,z) forx,y,z EG; 


that is, B is additive in the first variable. The symmetry of B implies the additivity 
of B in the second variable. 

The same conclusion can be obtained in relation to the Jensen equation (J) as 
follows (see Aczél, Chung, and Ng [40] and Aczél and Dhombres [44]). Use (K), 
(Q), and (4.2) to get 


2B(x + y,z) + 2B —y,z)=qaty+z)+q@—ytz) 
=q@ + ¥)—¢@@ = y) = 2q{z) 
= 2q(x +z) + 2q(y) — 2g (x) — 24(y) — 24 (z) 
=4B(x,z) forx,y,zEG, 


which for fixed z is Jensen. Since B(0,z) = 0, we conclude from Theorem 1.50 
that B(x, z) is additive in the first variable. The symmetry of B gives the biadditivity 
of B. 
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Finally, from (4.2) and (Q), we have 
2B(x,x) = q(2x) — 2q(x) = 4q(x) — 2q(x) = 2q(x), 


which is (4.1). 

To prove the uniqueness of B, suppose there exists another symmetric, biadditive 
Bi : Gx G = F such that g(x) = By, (x,x) forx € G. If we define Bz(x, y) = 
B(x, y) — Bi(x, y) for x, y € G, then Bz is also a symmetric, biadditive function 
with Bo(x, x) = Oforallx € G. Hence Bo(x+y,x+y) = 0 forx, y € G, implying 
that Bo(x, y) = 0; that is, B} = B. Thus B is unique. 

This concludes the proof of the theorem. 


Remark 4.2. Note that F = R or C may be replaced by any Abelian group divisible 
by 2 in Theorem 4.1. 


Remark 4.3. The condition (K) is equivalent to 
(4.4) qa t+y+2z)=qa4+z+yt+z) forx,y,zEG. 


Condition (K) implying (4.4) is obvious. Suppose (4.4) holds. Replace y by y — z in 
(4.4) to get (K) (also see Kurepa [588]]). 


Proposition 4.4. Letg : G > F satisfy (Q) and (K), where G is a group and F = R 
or C. Then, B given by (4.2) satisfies 


(i) Baxt+ty—-x — y,z) =Oforx, y,z € G, 
Gi) Baa + y,z) = Bey + x,z) forx,y,z€G, 
(ii) B(x + y —x,z) = BY, 2z) forx,y,z €G, 


and q satisfies 
(4.5) q@+yt+z)=qa+y)—4a@) +9042) -4%)+9@4+%)-40) 
for x,y,z € G. As a matter of fact, (i), (ii), and (iii) are equivalent to each other. 


Proof. By (4.2) and (K), and noting that g(—x) = q(x) (q even), 


1 
Ba ty—x—y,2)= Fla ty—x—ytz)-—q@ty—x—y—2)] 
1 
= qla@t+Cx-yt2+y)—-q@t+Ca-y-zt+y)] 
1 
a glaytz+y—a-y-z+y)] 
= 0, 


which is (i) (which can also use the additivity of B in the first variable). 
By using the additivity in the first variable, we get 


O= Baxty-x—y,z)=Bx+y,z)+ B(-x —y,z), 
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which is (ii). Finally, from (ii), we obtain 
B(x + y,2) = Bly + x,z) = B(y,z) + BG, 2); 


that is, 
Bix + y,z)— BQ, z) = BY, 2), 


which is (iii). 
Since B is biadditive, 


Bix + y,z) = B(x, z) + BY, 2z); 


that is, by using (4.2), we have 
1 
zla@ ty +2)—4g@ +y)— 9B) 


1 1 
a la +2z)—q(x)— q(z)] + z140 +2z)-—4(y)— ¢(z)], 


which is (4.5) (see also Kannappan [464]). 


Remark 4.5. Let G’ be the commutator subgroup of G. Then G = G/G' is a com- 
mutative group. If g : G — F satisfies (Q) and (K), then g = g 0 j, where ; is the 
canonical homomorphism of G into G and q: G = C satisfies (Q). 

To see this, for j : G > G defined by j(x) = x + G’ for x € G, define 


q:G=+C 
by 
qa@4+G)=q(x) forx eG. 


This ¢ is well defined. Note that g(x) = q(x + u) for any commutator u because 
of (K). Ifx + G’ = y+ G’, then x — y € G’ orx — y = u, acommutator. Then 
q(x) = q(y + u) = q(Q). Now, 
qa@t+G’+y+G)+qa4+G-(Vv+G))=qat+y+G)+qa-y+G) 

=qa+y)+q@—y) 

= 2q(x) + 2q(y) 

= 29q(x + G’) + 29(v + G’). 
Result 4.6. Let f : G — F satisfy (K). Then H = {x € G: f(x) = f(0)} contains 


G' and —x +a+x €H forx € Ganda eH. 
Now, 


faty-x-y=fat(xr-yt+y=fO>G CH. 
Forx €G,aedH, 


f(-x*x+a+x)-f@=0>5>-x+a+x€H. 
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Proposition 4.7. Let V be a vector space over a field F (which can be R or C) of 
characteristic zero. Thenq : V — F satisfying (Q) has the properties that 


(4.6) q is even, q(0) =0, g(rx) =r’q(x), forx EV, reQ, 
and 


u— 


(4.7) q(u) + q(v) = 2q (=) +2q ( *) foru,veV. 


Proof. Changing y to —y in (Q), we get g(y) = q(—y); that is, g is an even function. 
Set y = 0 in (Q) to get g(0) = 0. With y = x in (Q), we have q(2x) = 27q(x) for 
x € V. By induction and evenness of q, it follows that 

g(nx) =n’q(x) forx eV, ne Z. 
Also, we have 


x 1 a 
a (=) =a), neZ*,xev. 
n n 


Hence, for any rational 7 and x € V, we obtain 


m 2 (x m\2 
1(Ss) =ma (2) = (Zoe 
n n n 
u—v 
2 
Remark 4.8. Equation (4.7) holds when the domain of q is a 2-divisible group G 


and q satisfies (K). 


Replacing x by ure and y by 


for u,v € V, we indeed obtain (4.7). 


4.2 General Solution on a Complex Linear Space—Sesquilinear 
Solution 


Remark 4.9. By Theorem 4.1, for the quadratic function g in Proposition 4.7, there 
exists a unique, symmetric, biadditive function B : V x V — F such that g(x) = 
B(x, x). In general, this B is not bilinear or sesquilinear. The question arises as to 
what condition to impose on q in order for B to be bilinear or sesquilinear. 


Definitions. Let V be a linear space over C. A mapping B : V x V — Cis said to 
be bilinear provided B is biadditive and linear in each component; that is, 


BOAx, y) =AB(x, y), Bix, Ay) =AB(x, y), forx,yeV, AEC. 


The mapping B is called sesquilinear, antilinear, or conjugate linear if B is biaddi- 
tive and B(x, y) = AB(x, y) and B(x, wy) = WB(x, y) forx, ye V, A,w eC. 


Evidently, if B in Remark 4.9 is sesquilinear, then 
(4.8) q(x) = |A/?B(x, x) = |Al?q(x) forx EV, 2 EC. 


This condition seems to be adequate, and the corresponding result for vector spaces 
over the reals is false. 
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Theorem 4.10. (Kurepa [584, 585], Vrbova [823]). Suppose q : V —> C satis- 
fies (Q) and (4.8), where V is a complex linear space. Then there exists a unique 
sesquilinear form B, : V x V — C such that 


. 
Bi(x,y) = glae+y)- y+ sla +iy) — q(x —iy)] 
(4.9) = B(x, y)—iBlix,y) forx,yeV, 


where B(x, y) = gla + y) — q(x — y)] is given by (4.2). 


Proof. Kurepa in [584, 585] proved this result, and Vrbova in [823] gave a simple 
proof. Also see [44]. Here we present a different simple, direct proof of the same. 
By Theorem 4.1, there is a symmetric, biadditive B : V x V — C given by (4.1) 
and (4.2). 
Use of (4.2) and (4.8) yields 


B(x, Ay) = APB, y), BUx,y) = [PB (x=), 


vi 
(4.10) x 
B(x, Ay) = |A|2B (5.9), for x,y € V and for 4 (£0) €C. 
Further, 
Bui st =B ? ? 
(4.11) naan da | 
Bix, y) = B(x, -iy) =—B(x, iy), forx,yeV. 


Now (4.9) and (4.11) give By (ix, iy) = By(x, y), which in turn yields 
(4.12) Bi(x,iy) = —B,(ix,y) forx,yeV. 
Note that B, is biadditive. From (4.9), (4.10), and (4.11), we have 
By (ax, Ay) = |Al’Bi@, y), 
Bi(ax, y) = lar’: (x,2), 
(4.13) By (x, Ay) = |Al?Br (5.¥) , forx,yeV, A(¢O EC, 


B(x, iy) = B(x,iy) —iBix,iy) = —B(ix, y) —iB(x, y) 
= —1(B(x, y) —iBlix, y)) = —iBi(x, y). 


Thus combining (4.13) with (4.12), we obtain 


(4.14) Bi(ix,y) =i Bix, y)=—B,(x,iy) forx,yeV. 
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Using (4.13) and (4.14) for x, y € V andt € R, we get 


Bi((¢ +i)x, y) = Bi(tx,y)+iBi@,y) forx,yeV,teR 
y 
=(14+127)B 
(i+?) i(.) 
t-1 
—(1+1°)B — 
d+ ) (x. =) 
+47) (|8 : iB : 
i l Ke ———S 


(2 1+7 i 
By (4. ’) + ——-B,((1+ #”)x, y) 


~ T+ (+?) 
: [2 (<.9) + Bitx,y)| 


I 2 
+ Tage ee) TE x, y)] 


~ 14h 


t2 


1+? 


1 
=F, phe) + 


Bi(tx, 
ee i(tx, y) 


i 2 ; 
=p Tape) + By(t°x, y)I|; 


that is, 
(1 +7) By (tx, yytd+ t)i Bi (x, y) = Bix, ty) + t? By (tx, y) 
+i Bi, y) +iBix,y), 
meaning 
(4.15) By(tx,y) + it” By (x, y) = B(x, ty) + iB\(t?x, y) forx,yEV,teER. 
Changing ¢ into —t in (4.15) and adding or subtracting the resultant, we have 
By (t?x, y) = t’ Bi (x, y), By(tx,y) = B(x, ty), forx,yeV, teER, 
from which we can conclude (since B, is biadditive) that 
(4.16) By(tx, y) =tBi(x,y)= Bi(@,ty) forx,yeV,teER. 


Thus B, is sesquilinear. Indeed, for 4 = t + is, t,s € R, from (4.14) and (4.16) it 
follows that 


By (Ax, y) = Bi((t + si)x, y) = By(tx, y) + Bi(six, y) 
=1tBy(x, y) + siBi(x, y) = ABi(x, y) 
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and 
By(x, Ay) = Bi (x, (t + si)y) 
= By (x,ty) + Bi(x, siy) 
S tBy (x,y) — siBy(x, y) 
= AB, y). 


This completes the proof. 


4.3 Regular Solutions 


We will consider regularity solutions of (Q). 


Result 4.11. (Refer to Kurepa [576].) Suppose q : R — R satisfies (Q). If q is 
continuous or continuous at a point, bounded on [0, d[ (d > 0), or bounded ona set 
of positive measure or measurable, then q has the form 


(4.17) q(x) = cx? forx ER, 

where c is a real constant. 

Remark 4.12. For continuous q, (4.17) follows from Proposition 4.7 since g(rx) = 
r°q(x) forx €R, re Q. 


Also, from Theorem 4.1, g(x) = B(x,x), where B is biadditive. Now, from 
(4.2), we see that B is continuous in each variable so that, for fixed y, 


B(x, y) =c(y)x = cyx 
(using additivity in the second variable), which leads to (4.17). 


Remark 4.13. A mapping g : R — C satisfying (Q) and continuous is of the form 
(4.17), where c is a complex constant. 


Result 4.14. [576]. Let V be a Banach space over R. A mapping qg : V > R 
satisfying (Q) and continuous at one point is continuous everywhere and is bounded 
on every bounded sphere. 


Theorem 4.15. (Baker [89]). Suppose q : C — Cis a solution of (Q). Then q is 
continuous or continuous at a point if and only if there exist complex constants a, b, 
and d such that 


(4.18) q(z) =az*+blz\?+dz forz EC. 


Proof. If ¢ is of the form (4.18), it is easy to check that g satisfies (Q). We now prove 
the converse. 

By Theorem 4.1, there exists a biadditive B : C x C — C such that g(z) = 
B(z, z), for z € C, and B is continuous in each variable. By Theorem 1.23, for fixed 
wecC, 

B(z, w) =cy(w)z+co(w)z forz eC. 
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Applying once again the additivity in the second variable, we have 
ci(w + 0)z + c2(w + 0)Z = ci (w)z + c2(w)Z + c1(0)z + c2(v0)z_ forz,w,o EC. 


By choosing z real and purely imaginary, we see that c) + cz and c; — c2 are additive 
on C and are continuous. Hence 


(c) +. c2)(w) = ayw + aw, (c1 — c2)(w) = byw + baw, 
where a1, a2, b1, b2 are complex constants. Thus 
(4.19) B(z,w) = d\zw + doz0 + d3Z7w+ dyzw forz,w €C, 


where dj; (i = | to 4) are complex constants. This proves the theorem. 


Remark 4.16. If ¢ in Theorem 4.15 is differentiable, then q(z) = cz, where c is a 
complex constant. 


Result 4.17. [576]. Let H be a Hilbert space over R. Then, for every continuous 
q: H —> R satisfying (Q), there exists a unique, symmetric, bounded, linear opera- 
tor T : H — H such that 


q(x) =(Tx,x) forx €H. 


Further, if q(x) = q(x) for every orthogonal operator T : H > H andx € H, 
then 
q(x) =ellx||? forx € H, 


where c is a real constant. 


Result 4.18. [89]. Let V be a complex vector space and q : V — C be a quadratic 
functional satisfying (Q) such that, forx € V, 2 €C, 


IqQax)| = |2°2G()I. 


Then, one of the following holds for allx EV, AEC: 


q(ax) = q(x); g(x) =AP?q@); g(x) = 2q(x). 


Result 4.19. [89]. (Decomposition of a quadratic functional.) Letqg :V—> C 
be a solution of (Q), where V is a complex linear space. For each x € V, the 
mapping 4 — q(Ax) (A € C) is continuous (that is, continuous along rays) if and 
only if there exist three unique quadratic functionals q\, q2,q3 : V — C such that 
q=U11Q27 93, 


qx) = Paix), Ux) = |AP qo), a3(x) = 22q3(x), 


forx EV, AEC. 
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4.4 Generalizations and Equivalent Forms of (Q) 


Now we consider some generalizations and equivalent forms of (Q). 


Theorem 4.20. Let f : G — C be a nonconstant solution of 


(4.20) faty+f@—-—y)=af)+bf) forx,y eG, 


where a and b are constants and G is an Abelian 2-divisible group. Then either f is 
Jensen (that is, it satisfies (J)) or satisfies (Q). 


Proof. Change y into —y in (4.20) to get bf (y) = bf(—y). If b = 0, then (4.20) 
gives a = 2 and is thus Jensen (put y = 0). 

If b £0, then f is even. Interchange x and y in (4.20) to obtain (a — b)(f (x) — 
f(y) = 9. 

Since f is nonconstant, a = b. Now x = O (or y = QO) in (4.20) gives 
(a — 2) f(y) = —af (0). Ifa 4 2, then f is constant, which is not the case. Thus f 
satisfies (Q). 


Theorem 4.21. Suppose f : C — C is a continuous solution of 
(4.21) f(+w)+ fz—D) =2f(z)+2f(w) forz,wec. 
Then f has the form 

(4.18) f@=aG+zZy+ee—-Z forzeC, 


where a, c are complex constants. 


Proof. First restrict f to R. Then f satisfies (Q) on R and is continuous. Then by 
Remark 4.12 we have f(x) = c)x for x € R, where c, is a complex constant. 
Set w = it, t € Rin (4.21) to get 


f@tit)=f()+fit) forzeC, teER. 


First, taking z = is, s € R, we get “additivity” of f, and the continuity yields 
f (it) = cot, where cz is complex. Thus, for z = s + it, we have 


f@=f(s tit) = fo)+ FG =c1s? + cat 
a(z+Z)*° +b(z —2), 


which is (4.18). 


Usually in statistics, in particular in estimation theory, a quadratic expression 
has to be maximized subject to some constraints. This can of course be done by 
calculus methods. However, if the arguments of the quadratic forms are them- 
selves matrices, this method becomes very cumbersome and requires a lot of indices. 
Another method, known as the quasi inner product method is more often used. It is 
used in statistical estimation and mathematical programming. Drygas [230] proved 
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a Jordan and von Neumann type characterization theorem for quasi-inner products. 
In this characterization of a quasi inner product, the functional equation 


font foy=re-42[ 0 (2) (2) 


2 2 


played an important role. By changing y to —y in the equation above and adding the 
resultant to it, we obtain 


(4.22) faty)+ f@a—y)=2f)+ fo)+ fCy), 


an equation similar to (Q). 


Theorem 4.22. Let G be a group and F be a field of characteristic different from 2. 
Let f : G > F satisfy (4.22) for x, y € G and the condition (K). Then there exist 
an additive A and a biadditive B : G x G = F such that 


(4.23) f(x) = BQ, x) + A(x) forx eG. 


Proof. Either x = 0 or y = 0 in (4.22) gives f(0) = 0. 
Interchange x and y in (4.22) and subtract the resultant equation from (4.22) to 
get 
Fax -—y)=F(x)+F(-y) forx,y €G, 
where F(x) = f(x) — f(—x) for x € G. Thus F is additive (change —y to y), say 
2A. Then f(—x) = f(x) — 2A(x) and (4.22) becomes 


fe t+y)+ fe —y) =2f() +2f() — 2A(y); 


that is, g(x) = f(x) — A(®) satisfies (Q). By Theorem 4.1, (4.23) follows. 
From Proposition 4.4, it is easy to see that if g : G — F satisfies (Q) and (K), 
then (4.5) holds. But the converse is not true. We have the following result. 


Theorem 4.23. [464]. The general solution g : G > F (Ga group and F a field of 
characteristic different from 2) of (4.5) with (K) is of the form 


(4.23) q(x) = B(x, x) + A(x), 

where A: G > F is additive and B : G x G > F ts biadditive. 

Proof. Set z = 0 in (4.5) to get g(0) = 0. Putting z = —x in (4.5), we have 
qy)+4)+90)+4(-*)=94a+y)+9q0-%), 


which is (4.22). The result now follows from Theorem 4.22. 
Now consider the functional equation 


ety =O)+7Ot+Ieyr I OHs-er 7) 
+ f(y +2) + f<+2), 
where f : G — F. The only solution of (4.24) is f = 0. 


(4.24) 
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To see this, let z = x to get 
(4.25) FO) + Fy) +2f@) = fx +y) + fy +x) + FO). 


Now y = 0 in (4.25) gives f(x) + f(0) = f(—x); that is, f(0) = 0 and f is even. 
Again (4.25) shows that f is additive. Since f is even, f = 0. 


But if we make a slight change in an argument in (4.24), the equation is equiva- 
lent to (Q). This leads to the following theorem. 


4.5 Equivalence to (Q) 


Theorem 4.24. Let G be a 2-divisible group and F be either R or C or a field of 
characteristic different from 2. Suppose f : G — F satisfies (K). Then the following 
are equivalent: 


a) f satisfies (Q), 

b) f@+y—2)4+ 7 @)+7@y)4+ f@) =f -e—y) 4+ fey+2) + f+), 

o) f@—y=-2)4+f/Q)+fey)+ fee) = fe —y+ 7042) 4 f—e+%), 

A) Fie yee) ea ae Pa ae) ee 2) 
=4f(x)+4f(y) +4f@), 


forallx,y,z€G. 


Proof. We prove the implications in the order stated. 
To prove that (a) = (b), since f satisfies (Q) and (K), from Proposition 4.7 and 
Remark 4.8 we have that f is even, f (5) = af (x), and f satisfies (4.7). 


lett side of 0) = 27 (2%) 426 (E22) 4 ar (=) 427 (2) 


= f+ +2[ar (=) +27 (4) 


= f(—x —y)+ f(-z+x)+ fCy+z), 


which is the right side of (b). 

To prove that (b) > (c), set z = 0 in (b) to get f(x + y) + f(0) = f(—x — y); 
that is, f (0) = 0 and f is even. Now change y to —y in (b) to get (c). 

To prove that (c) = (d), put x = O0in(c) tohave f(—y —z)+ f(0) = f(y +z), 
which implies that f (0) = 0 and f is even. 

Use that f is even and f satisfies (K) in the following calculations. First replace, 
z by x + z in (c) to get 


(4.26) fxtyt+2a=—-f-y+f+a+fE+x)+ fa) +f) - ff). 
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Second, change x and z to —x and —z in (c) to obtain 


f(-*-y+z2=fat+y)+fo-at+ f(-z+4) 
— fx) -— fo) — f@) 


for x, y,z € G. Lastly, replace x and y by —x and —y in (c) to have 


(4.27) 


fi-x«+y-z)= fa —y)+ fl-y+z)+ f(e+x) 
— f@)— fQ) — f@). 


Now adding (4.26), (c), (4.27), and (4.28), we have 


(4.28) 


(4.29) left side of (d) = f(x +y)+ f —y)+2(fQ +2) +f — z)) 
+2(f(+x)+ f@—x)) —2f@) —27) —47@). 


Changing x to y, y to z, and z to x in (4.29), we obtain 


en) left side of (d) = f(y +z) + fly —z) +2(f(e +x) + f(z —x)) 
+2(f@+y)+ f@ —y))—2f() —2f(@) -—4f(@). 


Equate the right sides of (4.29) and (4.30) to have 
(4.31) fO+FD+FO-D=fat+y)+ f&—-y)-—2f@) + 2F@), 
which permuting x, y, z yields 


f@+x)+f@—-x) = fO+z24+fO—-—z2Z)—-2f0) +2f@) 
(4.32) =faty)+fa-—y)-2f(y) +2f(). 


Using (4.31) and (4.32) in (4.30), we obtain 


(4.33) left side of (d) = 5(f(« + y) + f(@« — y)) —6f (x) — 6f (y) + 4f (2) 
=S(fv+a+f—2))-6f0) —6f(z) +4f(@). 


Fix y and define 
h(x) =S(f@ + y) + F@ — y)) — 6F (x) — OF (). 
Then (4.33) yields h(x) + 4f (z) = A(z) + 4f (x); that is, h(~) — 4f (x) = constant 
= t(y), say. Finally, (4.33) becomes 
left side of (d) = 4 f(x) +t(y) +4 f(z). 


The symmetry of the left side in x, y, z shows that f(y) is 4 f(y), which shows that 
(d) holds. 

To complete the cycle, we will now prove that (d) > (a). Let y = 0 = z in (d) to 
have f(—x) = f(x) +4f(0). Thus f(0) = 0 and f is even. Now put z = 0 in (d) 
to have (Q). This completes the proof of this theorem. 
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4.6 Generalizations 


First we consider a generalization of (d) in Theorem 4.24 and prove the following 
theorem. 


Theorem 4.25. Let G be a 2-divisible group and F be either R or C or a field of 
characteristic different from 2. Suppose f; : G — F (i = 1 to7) satisfy (K) and the 
functional equation 
hie + yay oe —y ~ 4) tie — Pe) 
+ fa(—x + y — 2) =4f5(x) + 4 fey) + 4f7@) 

for x,y,z € G. Then they are given by 

fix) = q(x) + (A2 + A3 + Ag)Qe) +41 

= B(x,x) + (A2 + A3 + Aa)(x) + b1, 

fo(x) = B(x, x) + (2A — Az — A3 + Aa)(x) + bo, 

f3(x) = Bx, x) + (Az — A3 — Aa)(x) + 53, 

fa(x) = B(x, x) + (2A1 — Az + A3 — Ag)(x) + ba, 

fs(x) = B(x, x) + A4(x) + ds, 

fo(x) = B(x, x) + A3(x) + be, 

fi(x) = B(x, x) + (A2 — A1)@) + b7, 


(4.34) 


(4.35) 


4 a 

with >” bi = 4 > bj, where A; is additive (i = 1 to 4), q satisfies (Q), and B : 
i=l j=5 

G x G > F is biadditive. 


Proof. Replacing x, y, and z in (4.34) by —x, —y, and —z, respectively, adding and 
subtracting the resultant from (4.34), and using (K), we get 
fetta ys Eee Ps) 
+ fg(—x +y — 2) =4f5(«) + 4 F650) +457 @), 
AGt+yt+oq+h@-y-O+hGs-y+2) 
+ fe(—x +y —2) =4f3@)+ 4/60) +477), 
for x,y,z € G, where f%, f° (i = 1 to 7) are the even and odd parts of f; ( = 1 
to 7). That is, Ff? and f? also satisfy (4.34). Note that de is even, f? is odd, and 
Ff?) =0. 
Finding f/ is relatively easy by reducing to the Pexiderized Jensen equation 
several times and reducing to (d) of Theorem 4.24. We do that first. 
Setting z = —y and z = y separately in (4.34e), we get 
(4.36) fi (x) + fy (x) + fg (x — 2y) + fg (—x + 2y) = 4 f§ @) +46 + 0) 


and 


(4.37) ff @+2y)+ @ —2y) + fe) + fF) = 495) + 406 + £70). 


(4.34e) 


(4.340) 
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Subtracting (4.36) from (4.37), we have 


ff — f5)@ + 2y) + Gy -— F@ — 2y) = OT + fa -— £3 -— LG) 


for x, y € G, which is a Pexiderized Jensen equation. So there exists an additive A, 
on G such that 


ff-fp=Aita, fy—fy =Ait+e, c1,c. are constants. 
Since ff (i = | to 4) are even and A, is odd, we have 
(4.38) fi-fR=a, fxy—- ff seo. 
With y = 0, (4.36) gives 
(4.39) fit+tft+fet+ ff =4f§ +03, 3 is a constant. 
Similarly, letting y = —x and y = x separately in (4.34e), we obtain 


4.40) fi-fra=a, KR-h=o, fith+h+h =4h +0, 


where c4, C5, and c¢ are constants. 
In (4.34e), x = 0 = z gives 


(4.41) Fit di + + ff =4fe +c. 


From (4.38) to (4.41), we see that fy to fy differ from ff by a constant, and 
substituting these values of fy to f7 in terms of f/ in (4.34e), we have 


ii@+yt+a+A@=y4+2) 4+ fiery +z) 
+ fi(xty—2=4ff0) 4+ 4/70) + 4f7@) +6, 


where c is a constant. 

With x = 0 = y = z in the above, we get c = —8f/'(0), and then it is easy to 
see that f/ — f/ (0) satisfies (d) of Theorem 4.24. Thus, we get from Theorem 4.24 
and Theorem 4.1 that 


(4.42) fi (x) =q(x) +b) = Bix, x) +b; forx eG 


4 7 
(i = 1 to 7), where q satisfies (Q), B is biadditive, and >* bj = 4 >? bj. 
i=l j= 
Now, to determine /,? is a little bit tedious. Again we reduce (4.340) to the Jensen 
equation several times. 


Letting z = —y and z = y separately in (4.340), we have 


(4.43) fP(%) + fy (&) — fy @ + 2y) — £2 @ — 2y) = 4f3@) +406 — £7)0); 
(4.44) fp +2y) + f2@ — 2y) — f9@) — 2 @) = 4F5 &) +406 + £70). 
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Subtracting (4.43) from (4.44), we have 
4 
(4.45) (FP + FRx + 2y) + 2 + AP) — 2y) = 8 FP (y) + (> it) (x). 
i=1 
Changing y to —y in (4.45) and adding the resultant equation to (4.45), we get 
4 4 4 
(> i) (x +2y)+ (> i) (x —2y)=2 (> i) (x), 
i=1 i=1 i=1 
which is Jensen. Since f(0) = 0, 
(4.46) FP+R +R +S =A 


where A is additive on G. 
First x = 0 in (4.45) gives 


GP - f+ - Fy) =8F70), 
and then x = 0 = y in (4.340) gives 
(ff — f+ 42 -— f)@) =4F7@), 
so that f?(2y) = 2 f?(y), and with (4.46) we obtain 
(4.47) 2h; + f9) =4f7 + At. 
Now (4.46) and (4.47) in (4.45) yields 
CFP + £3) + 2y) — FP + F9)@ -— 2y) = 207 + FEY), 
which is Jensen. Noting that (f? + f?)(0) = 0, we get 
(4.48) fo +fp=Ar, fPt+fR=Ai—A2, 4/7 =2A2.-Al, 


where A2 is additive on G. 
Letting y = x and y = —x separately in (4.340) and subtracting the latter from 
the former, we obtain 


(4.49) CFP + £2) 2x + 2) — G3 + FP) (2x — 2) = A1@) + 8f6 (x). 
Put z = 0 in (4.49) to get 


Cf? + 2 — f8 — R)Qx) =8f8(2) forx eG. 
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Letting z become —z in (4.49) and adding the resultant to (4.49), we have 


Gi + fe — fo — RICx +2) 
+P + fa — 3 — F2)VAx — 2) = 16f5) 
= 2(ff + fa — fy — f3) 2x), 


which is Jensen, so that, as before, we get 
(4.50) ff-h-h+h=A3, 4f6 =As, 


where A3 is additive on G. 
From (4.46) and (4.50), we have 


(4.51) 2(ff + fz) =Ai+ Az, 2(f9 + f3) = Ai — As. 
Letting y = 0 = z, (4.340) gives 
(4.52) FP+R-- Se HAPs. 


Finally, put z = 0 in (4.340). Then replace y by —y and add the resultant equation 
to obtain 


Gf t+ fe — fs — fey@+y) 
+(ff + fo — £3 — FA) — y) = 8f3(y) 
= 2(ff + f2 — £3 — f2)Q) (use (4.52)), 


which is Jensen, so that 
(4.53) fithi- -i =AeH=48 


where Az is additive. 
Use (4.50) and (4.53) to have 


2(f7 — f3) = Aagt As, 
2(f7 — fa) = A4 — As. 


These equations with (4.48) yield 


A2 ara Az A3+A4 

4.54 2 ge 
oe po = Bien fies > Al— Ar Ag—A3 
aD 4 ee 8 4° 


Now (4.48), (4.50), (4.53), and (4.54) give f? (i = 1 to 7). These together with 
(4.42) give the result sought, (4.35). This completes the proof of this theorem. 


Remark 4.26. Note that (4.34) determines seven unknown functions without any 
regularity assumptions on f;. 
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Result 4.27. [464]. Let V be a linear space over the field F of char = 0 (or different 
from 2). Suppose fi : V > F (i = 1 to7) satisfy the functional equation 


A@+y+z2+ fake) + fBO)+ fa@ = fsa +y)+ foyt+2)+ f7z@ +x). 


Then there exists biadditive B: V x V — F, additive A, Ai: V > F (i = 1 to 3) 
such that f;’s are given by 


Ai@)= B@,a) +AG) +64 G+ Cs, 

fox) = Bx, x) + AQ) — Aix) — Az) + C3 — Co, 
f3(x) = BQ, x) + A(x) — Aix) — Ao) — C2 — Cs, 
fax) = B(x, x) + A(x) — Ar(x) — Asx) + C1 — Ca, 
fs(x) = BQ, x) + A(x) — Ai@®) — C2 +03 +Cs, 

fo(x) = Bx, x) + A(x) — Az) + C1 -— C4 +Cs, 

fi(x) = Bx, x) + A(x) — A2(x) — As) + C1 + C3 — Co, 


where C;’s are arbitrary constants. 


4.7 More Generalizations 


We determine the general solutions of the functional equation 


(4.55) A@+ty+fh@a—y) = f@)+ fay) forx,y €G, 


where fj : G > F (i = 1 to 4), G is a 2-divisible group, and F is a commu- 
tative field of characteristic different from 2. The motivation came from a result 
due to Drygas [230]. Also, this equation is a generalization of the quadratic func- 
tional equation (Q). Special cases of this equation include the additive equation (A), 
the Jensen equation (J), the Pexider equation, and many more. Here we determine 
the general solution of (4.55) without any regularity assumptions on f;. Note that 
equation (4.55), like (4.34), determines several unknown functions, in this case four. 
We prove the following theorem. Even though it is solved in Ebanks, Kannappan, 
and Sahoo [241], here we present a different proof. 


Theorem 4.28. Let G be a 2-divisible group and F be a commutative field of char- 
acteristic different from 2. The general solution of (4.55) with f, and fz satisfying 
the condition (K) is given by 

fix) = BQ, x) — (Ai — A2)@) + b1, 

fa(x) = B(x, x) — (Ai + A2)Q@) + bo, 

f3(x) = 2B(x, x) — 2A1(%) +b, 

fa(x) = 2B(x, x) + 2A2(x) + ba, 


(4.56) 


with b, + bz = b3 + ba, where B: G x G > F is a symmetric biadditive function 
and Aj: G > F (i = 1, 2) are homomorphisms (that is, additive). 


4.7 More Generalizations 
Proof. Interchange x and y in (4.55) to get 
(4.57) AQ+x)+ fh —x) = AO)+ fa) forx,y €G. 
First subtract (4.57) from (4.55) and use (K) to have 
(4.58) fe(x—y)=a(x)—-a(y) forx,y EG, 


where f) (x) = fa(x) — fo(—x), a(x) = fa(x) — fal). fy is odd. 
Set y = 0 in (4.58) to obtain a(x) = fy (x) +1, so that (4.58) becomes 


fea -y=f7@)- £70): 
that is, hoy is additive, say 
(4.59) i = An 


Then a(x) = Ay(x) +c. 
Second, add (4.57) and (4.55) and use (K) to have 


(4.60) 2fitxty)+ fr—y)=f@)+fQ) forx,y €G, 


where fs (x) = f7(x) + fo(—x), BO) = fale) + far). 
Now y = 0 in (4.60) gives 


(4.61) 2fix) + fe) =Pha)+e, c= £80). 
Eliminating f; from (4.61) and (4.60), we have 

(4.62) —f5 +y)+ fe @ —y) +B +y) +02 = BO) + £0). 
Changing y to —y in (4.62), we obtain 

(4.63) —f3@—y)+ fea +y)+ BO —y)+c2 = BO) + By). 
Adding (4.63) and (4.62), we get 


B(x ty) + BO — y) + 2c2 = 2B) + BY)+B(-y) forx,yeG. 
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This equation is (4.22). Noting that f also satisfies the condition (K) from (4.60), 


from Theorem 4.22, f is given by 
(4.64) B(x) = B(x, x) + A(x) +c2 forx EG. 


From (4.59) and (4.64), we get 


(4.65) 


2 f3(x) = B(x, x) + (A+ A1)\@) +14 €2, 
2 fax) = B(x, x) + (A — A1)(®) — 1 +002. 
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Using £(x) given by (4.64) in (4.63), we have 
Fp @ +y)— fe @ — y) = 2BG, y), 
which with y = x gives 
(4.66) fy) = 5Blx,x) +03. 
From (4.59) and (4.66), we get 
(4.67) 2 fox) = 5BO.x) + AiG) tes. 
Now (4.61), (4.64), and (4.66) yield 
(4.68) 2filx) = 5B, x) + Ale) + 2c +63 


Equations (4.65), (4.67), and (4.68) give the solution sought, (4.56). This proves the 
theorem. 


4.8 Another Form of Quadratic Function 


Oresme in 1347 verbally described for the quadratic function a functional equation 
that can be written as 


s(m+1)t)—s(t) n+l 
ee?) s(nt)—s((n—1)t) 2n—1 


for all positive integers n and positive real t. The same equation with the same so- 
lution reappears almost three centuries later with Galileo (1638) in connection with 
falling bodies. In the following theorem, we obtain the continuous solution of (4.69). 


Theorem 4.29. Lets : Ry — Ry satisfy (4.69) fort € Ry andn € Zx be 
continuous. Then s(t) = cyt? +2 fort € R4, where c1, c2 are constants. 


Proof. If s is a solution of (4.69), so is s — s(O). Without loss of generality, take 
s(0) = 0. Set n = 1 in (4.69) to get 
s(2t) — s(t) =3s(t) or s(2t) = 2*s(t). 


Similarly, n = 2 gives s(3t) = 37s(t). So we will prove by induction that s(nt) = 
n’s(t) for positive n and t € R,. Suppose s(nt) = ns(t). Then, by induction, 
(2n — 1)s((n + 1)t) = Qn + 1)(s(nt) — s(n — 1)t)) + Qn — 1)s (nt) 
= (2n + 1)(n? — (n — 1)*)s(t) + Qn — 1)n’s(t) 
= (2n —1)(n+ 1)’s(t). 


4.9 Entire Functions and Quadratic Equations 241 


That is, 
s((n+1)t)=(n+1)’s(t) forne Z4, te Ry. 
Also ‘ 
t 
S (<) = s(t) forte Ry, ne Ry. 
n n 
Thus 
m 1 m\2 
s (=*) = = (mt) = (=) s(t) form,neée Z4, te Ry. 
n n n 
So 


s(rt)=r’s(t) forre QO, teER,. 
Invoking continuity of s, we have that 


s(xt)=xs(t) forx,t€ Ry. 


Setting t = 1, we have s(t) = ct?. This proves the theorem. 


4.9 Entire Functions and Quadratic Equations 


In this section, we will consider three functional equations studied by Haruki [348]: 


(4.70) IfG@+twy+1fC-— wv) =2AfOl+2fW)), zwec, 
471) |f@+w)P+1f@- wv)? =2f@OP+2fW)P, zw eC, 
where f : C > Cis an entire function, and (4.76). 

First we treat equation (4.70) and present two proofs. 


Theorem 4.30. [348]. Suppose f : C — C is an entire function. Then f is a 
solution of equation (4.70) if and only if f (z) = bz”, where b is a complex constant. 


Proof. For the first proof, we need the following two results about additive and biad- 
ditive functions. 
Suppose A : C — R satisfies (A) and is continuous. Then 


A(z) = A(x + iy) = ax + dy, 


where a and d are reals, z =x +iy, x,yER. 
If A: C > R, we can consider A also to be from C to C. Since A is continuous, 
by Theorem 1.23, 
A(z) =ajz+aoz forzeC, 


where a; and a2 are complex constants. But A is real, so ajz + a2Z = G1Z + 422Z; 
that is, (a, — @2)z = (@| — a2)z. Setting z = 1 andi, we get that a2 = a,. Thus 


A(z) = a12+ @Z = (41 + @)x +i(ai — @)y 
=ax+by, 


where a and b are reals. 
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Next we consider the biadditive function B: Cx C> R.LetB:CxC-R 
be biadditive and continuous in each variable. Then 


B(x +iy,u+iv) =ajxu+a2xv0 + a3yu+aqgyvo forx,y,u,v ER, 


where a; (i = | to 4) are real constants. 

By the part above (for fixed w and for fixed z), we get 

B(Z, w) = ai (w)x + a2(w)y 

(4.72) =b\(z)u+b2(z)v forz,w EC, 
z=xt+iy, w =utinv, ay(w), a2(w), b1(z), b2(z) are real functions, and 
xX,y,u,veR. 

y = Oin (4.72) gives a;(w)x = bi (x)u + b2(x)v. Then 

a(wt+w')x =bi(x)utu’)+b(x)\o+0'), wl =u’ +iv’ 
=a(w)x +a,(w’)x forw,w' €C, x ER. 


Thus, a;(w) is real, continuous, and additive on C. So, 
ai(w) = dju + dy, 
where d, d2 are real. Similarly, 
a2(w) = d3u + d4v, 


where d3,d4 are real. Substitution of these values of aj(w) and a2(w) in (4.72) 
yields the required form of B(z, w). 
For the first proof of (4.70), let g : C — R be defined by 


q(z) =|f(z)| forz eC. 


Since f satisfies (4.70), qg satisfies (Q) on C. Then, by Theorem 4.1, there is a biad- 
ditive B : C x C > R such that q(z) = B(z, z) forz € C. 
Use of the biadditive result above yields 


q(z) = ax? + 2hxy + by’, 


where z = x + iy anda, h, b are real constants. 
This conclusion can also be obtained from Theorem 4.15, noting that q is real. 
Now we have | f(z)| = q(z) = ax? + 2hxy + by? witha = b =h = Oor 
a> 0, ab—h? > 0. Ifa > 0, ab —b* > 0, then we get A log|f(z)| = 0 for 


; 2 2 ; 
0 < |z| < oc, where A stands for the Laplacian operator = + S. Thus we obtain 


a = bandh = 0; thatis, | f (z)| = a(x? + y2) = alz|2. This proves the theorem. 


For a second, alternate proof from H. Haruki [348], let MW be the absolute maxi- 
mum of | f (x)| in |z| = 1. Putting g(z) = | f (z)| — M|z|?, where g : C > R4, by 
(4.70) we have 


(Q) q(z+w)+q(z— w) = 2q(z) +2q(w). 
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Since g(z) < 0 in |z| = 1 and g(0) = O, there exists a complex constant zo with 
|zo| < 1 such that q(zo) is the absolute maximum of q(z) in |z| < 1. (We use 
Weierstrass’s theorem.) 

Putting z = zo in (Q), we have 


(4.73) q(zo + w) + qo — w) = 2g (Zo) + 2q(w). 
Putting 6 = dist (zo, |z| = 1) (© 0), by (4.73) we have, for |w| < 0, 
q (zo) + 4 Zo) = 2q (Zo) + 2q(w). 
Hence we have, for |w| < 0, 
(4.74) q(w) < 0. 
Putting w = z in (Q) and using g(0) = 0, we have, for |z| < +00, 
(4.75) q(2z) = 4q(z). 


By (4.74) and (4.75), we have q(z) < 0 in |z| < +o0. Hence we have | f(z)| < 
M|z|? in |z| < +00. Hence, we have f(z) = Cz”, where C is a complex constant 
with |C| < M. Thus the theorem is proved. 


Now we consider the functional equation (4.71). 


Theorem 4.31. Let f : C — C be an entire function that satisfies equation (4.71) 
for z,w € C. Then f(z) = cz, where c is a complex constant. 


Proof. Define g : C > R by 

q(z) = |f@)I?. 
Then (4.71) becomes (Q). By Theorem 4.1, g(z) = B(z,z), where B: Cx C—>R 
is biadditive. By following the proof of Theorem 4.30, we see that 


If(@OP =q@ =alzl?, a>. 


This proves the theorem. 


Note that in both the theorems we are using the fact that if f, g : C > C are 
entire functions such that | f(z)| < m|g(z)|, where m > 0 is a real constant, then 
f(z) = cg(z) for a complex constant c. 


Result 4.31a. (H. Haruki [348]). If f : C — C is an entire function and satisfies 
equation (4.71), where z € C and w is real (or purely imaginary), then the solution 
of (4.71) is cz, where c is an arbitrary complex constant and only this. 


Result 4.32. [348]. [f the entire functions f, g,9, w : C > C satisfy the functional 


equation 


(4.76) If + w)| + 1g — w)| = 2/e@)|+2lyW)| forz,w €C, 
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then all solutions of (4.76) are the following and only these: 

@ f@)=4, g@) =), 9@) =c, w@) =4 with |a| + |b] = alee ld\), 

Gi) f@) =@z +b), g@) =e (az +c), o@) =e (az + BE)", 
w(z) =e! (az + ag 


where a # 0, b,c are arbitrary complex constants, and 6; (i = 1,2,3) are real 
constants. Compare this with Theorem 4.28. 


4.10 Summary by Stetkaer [750] 


4.10.1 The Quadratic Functional Equation 


A norm || - || on a vector space V stems from an inner product (-,-) on V (ie., 
\|x ||? = (x, x) for all x € V) if and only if it satisfies the parallelogram identity 


(JvN) lx + yll? + lx — yl? = Ill? +2Ilyll?, x,y eV. 


This is classical knowledge, going back to the theorem of Apollonius in Euclidean 
geometry: The sum of the squares of the lengths of the two diagonals of a parallel- 
ogram is equal to the sum of the squares of the lengths of the four sides. A special 
case of Apollonius’s Theorem, is Pythagoras’s Theorem, which asserts the same for 
rectangles. Jordan and von Neumann [398] proved the result for vector spaces. 

Generalizing the parallelogram identity from a vector space V to a group G, we 
are led to the quadratic functional equation 


(Q) f(xy) + fy!) =2f@)+2f0), x.y EG, 


where f : G — H is to be determined. We allow G to be any group and the range 
space H of f to be any Abelian group. 

To describe the results about the quadratic functional equation on groups in a 
short way, we introduce the following (nonstandard) terminology. 


Definition. Let G be a group and H be an Abelian group. We will say that a map 
f : G — H 1s a quadratic function if there exists a symmetric bimorphism B : 
G x G = H such that f(x) = B(x, x) forall x € G. 


We choose the word function to avoid confusion with earlier works in which a 
quadratic form or a quadratic functional by definition is any solution of the quadratic 
functional equation. Any quadratic function is a solution of the quadratic functional 
equation (Q), but the converse is not true in general even for H = C. 

The basic result for Abelian groups is the theorem below. It is due to Aczél 
[40, 44]. In Kannappan [471, Result 1], it is mentioned that its assumption of G 
being Abelian can be replaced by Kannappan’s condition (K). 


Theorem. [40]. Let G be an Abelian group and let H be an Abelian group in which 
every equation of the form 2x = h € H has one and only one solution x € H. Then 
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any solution f : G — H of the quadratic functional equation on G is a quadratic 
function. 


Motivated by the original situation (JvN), Kannappan [471] studied the equation 
(Q) when G is the additive group of a linear space, to find out when solutions arise 
from bilinear functionals. So there is not only the group structure to take into account 
but also the multiplication by scalars. For literature about the vector space situation, 
we refer to [44, 471] and the recent monograph [187] and their references because 
here we shall discuss the pure group case, where the group may even be non-Abelian. 

Kannappan [471] also finds equivalent forms of the quadratic functional equation 
when the solutions f : G — F are defined on a 2-divisible group G, take their values 
ina field F of characteristic different from 2, and satisfy Kannappan’s condition (K). 
In this setting, he solves certain generalizations of the quadratic functional equation. 

The key to the studies on a non-Abelian group G is the following observation. 


Observation. If f : G — A is a solution to the quadratic equation, then its Cauchy 
difference Cf satisfies Jensen’s functional equation (J) in each of its variables when 
the other variable is fixed. 

In a recent preprint, Friis and Stetkaer [293] showed the following results: 


1. Any solution of the quadratic functional equation on a group G is a function of 
G/{G, G’], as is the case for solutions of Jensen’s functional equation. 

2. If [G, G’] = G’ holds, which it does for certain non-Abelian groups such as 
GL(n, R), n > 2, then all solutions of the quadratic equation on G are quadratic 
functions. 

3. A solution of the quadratic equation on a product of groups satisfies Kannap- 
pan’s condition if and only if its restriction to each of the subgroups satisfies 
Kannappan’s condition. 


5 


Characterization of Inner Product Spaces 


The inner product space, characterization of i.p.s., parallelepiped law and generation, 
different characterization of i.p.s. and its generalizations, and different orthogonality 
are considered in this chapter. 

We repeat the customary definition of an inner product space. Recall that a linear 
(vector) space V is an inner product space (i.p.s.) provided there is a mapping (called 
the inner product (i.p.)) (, ): V x V > F, a field (F = R or C depending on 
whether V is a real or complex vector space) satisfying the following conditions: 


(i) (v, v0) > O with equality if, and only if, o = 0 foro € V; 
(ii) symmetry: 


(u,v) = (v,u) (real symmetry) in the real case; 


(v,u) = (u,v) (conjugate symmetry) in the complex case for u,v € V; 
(iii)additivity in each variable: 


(u+ov,w) = (u, w) + (v0, w); 
(u,v +w) = (u,v) + (u,w) foru,v,w € V; 


(iv) homogeneity: 


(Au, v) 
D 


(u,v), AERorC, uve V; 
(u, Av) u,v 


=A 
=J(u,v), AEC, uve V. 

The inner product gives rise to a norm ||-|| (metric) on V given by ||o|| = /(v, v), 
leading to a normed linear space (n.l.s.) V. What is of interest is the converse ques- 
tion. In the class of n.Ls., the i.p.s. are particularly interesting. It is well known 
that most natural geometric properties may fail to hold in a general n.1.s. unless the 
space is an i.p.s. It is not without interest to determine when a linear metric space or 
n.l.s. V is isometric with a Hilbert space or an i.p.s. In other words, when can one 
define in it a bilinear symmetric i.p. from which its given metric can be derived in 
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the customary way? One might then wish to investigate suitable conditions on the 
n.l.s. V that can be imposed on the norm under which the given norm on V could 
have arisen from an inner product. Numerous basic characterizations of i.p.s. under 
various conditions were given starting with Fréchet [291], (typical classical result of) 
Jordan and von Neumann [398], etc. Conditions have been found with motivations 
that are highly geometrical, and others are primarily algebraic. There is an interest- 
ing survey book by Amir [73] that lists quite a number of characterizations. Since 
then, many new characterizations have been found. Some of them have been given 
in terms of the functions (normal derivatives) (Amir [73], Alsina and Garcia-Roig 
[70]) ph. : V x V > R defined by 


2 2 
: x+t — ||x 
inte —ore. 
~~ t>0+ t 


height functions [70] ht: V x V > Rby 


Iv? — ph. @, y) 
ee eS), 
lx — yll 


etc., and also in terms of the classical orthogonal relations. 

Numerous equivalent definitions of orthogonality of elements of abstract 
Euclidean spaces can be given (Amir [73], James [389]). The most natural is the 
Pythagorean relation: x L? y whenever ||x + y||? = ||x|I? + |Ly|l?. 


hix,y)=yt+ 


Other Orthogonal Relations 


Blaschke-Birkhoff-James orthogonal relation: x 1® y whenever ||x|| < |x + ty|| 
forallt ER, x,y eV. 


Robert’s relation: x L® y ~ ||x +ky|| = |x —ky|| forallk ER, x,y € V. 


James relation (isosceles orthogonality): x ’ y whenever ||x — y|| = ||x + yl]. 


The concept of orthogonality has certain desired properties, the most important 
being that every two-dimensional subspace contains nonzero orthogonal elements. 

Broadly, characterizations fall into (i) equalities, (ji) normal derivatives, 
(iii) approximations, (iv) various orthogonality conditions, (v) inequalities, (vi) func- 
tional equations, etc. Item (v) inequalities will be treated later, in Chapter 16. 

The usual arguments for the characterizations of i.p.s. deal with rather sophisti- 
cated geometric properties of a unit ball (smoothness, etc.). A functional approach 
favours direct arguments, and some of these may be valid in the case of vector spaces 
over fields other than the real field. 

Functional equations are instrumental in many characterizations. It is fruitful to 
look at the matter from this point of view. One of the objectives of this chapter is 
to bring out the involvement of functional equations in various characterizations of 
i.p.s. (Aczél [44], Dhombres [218], Day [214], Rassias [684]). 

We start off with a result of Fréchet [291]. In Theorems 5.1 and 5.2, Fréchet 
discovered a necessary and sufficient algebraic condition from which he derived this 
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more abstract criterion: An n.l.s. is an i.p.s. if, and only if, every three-dimensional 
subspace V’ of V is isometric with a Euclidean (three-dimensional) space. The 
next interesting movement forward was from Jordan and von Neumann [398], who 
described another necessary and sufficient algebraic condition that implies that 
Fréchet’s abstract condition can be weakened as follows: An n.l.s. is an i.p.s. if 
and only if every two-dimensional subspace V’ of V is isometric with a Euclidean 
(two-dimensional) space. The basic algebraic (norm) condition that results from the 
above that makes the n.l.s. an i.p.s. (and the oldest prototype of all these results) 
is the parallelogram identity, also known as the Jordan—von Neumann identity (or 
Appolonius Law or the norm equation) 


(JvN) lx + yll? + Ix — yl? = 2ix|? +2Ily? forx,y eV. 


This translates into a well-known functional equation known as the quadratic func- 
tional equation, 


(Q) q(x +y)+q( — y) =2q(x)+ 2q(y) forx,y € V, 


studied in Chapter 4. (Because of this fact, most of the computations centre around 
(JvN) and (Q).) 

In a real two-dimensional space, it is well known that the space is an i.p.s. if and 
only if the set of points of norm one is an ellipse [212] (a geometrical characteriza- 
tion). 

First we consider Fréchet’s result and then follow it with the Jordan—von 
Neumann result. 


5.1 Fréchet’s Equation 


The functional equation resulting from Fréchet’s condition [588, 464, 463, 465, 580] 
iS 


(F) 
p(xtyt+z) = pxt+y)—p()+py+z)—p(x)+p(zt+x)—p(y) forx,y,z€G. 


We obtain the solution of (F) on a general setting. 


Theorem 5.1. (Kannappan [465, 471]). Let G be a group (not necessarily Abelian), 
Han Abelian group, and p : G — H satisfy the functional equation (F). Then p 
satisfies 


(5.1) 8p(x) = 4A(x)4+ B,x) forx eG, 


where A: G — H is additive and B : G x G — H is biadditive. 


Proof. Set z = 0 in (F) to get p(0) = 0. 
With z = —(x + y) in (P), (F) gives 


p(x +y)— p(-@ + y)) = p(x) — p(—x) + p&Q) — p(-y); 
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that is, 
(5.2) A(x) = p(x) — p(-x) forx eG 
(satisfies (A)). Set z = —y in (F) to obtain 
pQty)+ py +x) =2p(x)+ pY)+ py) forx,yeG 


(similar to (4.22)). Replacing x by —x and adding the resultant to the equation above, 
we have 


(5-3) q@+y)+q(—-x + y) =2q(@~)+2q(y) forx,y eG, 
where 
(5.4) q(x) = p(x) + p(—x) forx €G. 


Remark. If G is Abelian, (5.3) is (Q) and (5.4) and (5.2) give (5.1). 

Note that (0) = 0, q is even, g(2x) = 4q(x),q@@ + y) =q(y +x), and 
(5.5) 2p(x)=q(x)+ A(x), forx EG. 
Since p and A are solutions of (F), so is qg; that is, 


qa+yt+z=qe+y)-q@)+4a0+2) 


(5.6) 
—q)+q(z+x)-—q(y) forx,y,z EG 


(see (4.5)). 
Define B: G x G > H by 


(5.7) Bix, y)=qx+y)—q@—y) forx,yeG. 
From (5.6), (5.7), g(x + y) = g(y + x), and that g is even, we have 
Baty, zy=qa+yt+z)-—qaut+y—-2z) 
= qQ+2)—¢Q —2) + 9¢@4+%)—q@-z+4) 
= B(x,z)+ By, 2); 


that is, B is additive in the first variable. By the symmetry of B, it follows that B is 
biadditive. 

From (5.7) and (5.3), it is easy to see that 4g (x) = B(x, x), and then from (5.5) 
we obtain (5.1). This proves the theorem. 


Remark. Historically, (Q) is connected to i.p.s. Here again we obtain (Q) as follows. 
Changing y to —y in (5.4) and using that g is even and g(x + y) = q(yv + x), we 
get (Q) from (5.3). 


Remark. If H is a 2-divisible Abelian group, then (5.1) becomes p(x) = A(x) + 
B(x, x) (see also Theorem 4.29 and (4.27)). 
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Now we consider a generalization of (F): 


pPaty+z=rat+y)—-rz)+siy +z) 


C1) 
—s(x)+t(z+x)-—t(y) forx,y,z EG. 


Theorem 5.1a. (Kannappan [470]). Let G be a group (not necessarily Abelian) and 
Han Abelian group. Let p,r,s,t : G — H satisfy the functional equation (F1). 
The most general solution of (F1) is given by 


16p(x) = 16(r(x)—a) = 16(s(x)—b) = 16(¢t(x)—c) = 4A(Xx)4+- B(x, x) forx €G, 


where A is additive, B is biadditive, and a, b, c are constants. 
Proof. Set x = y = z = 0 in (F1) to get p(O) = 0. Now put y = z = O in (F1) to 
obtain 


(5.8) p(x) =r(x)-a-s(x)+b+4+t(x)-c forx eG, 


where r(0) = a, s(0) =b, t(0) =c. 
From (5.8) and (F1) with z = 0, we get 


a(x +y)=a(x)—a(y) forx,y EG, 


where a(x) = p(x) —r(x) +a. 
Obviously, a(0) = 0 and a(y) = —a(y) (put x = 0 above); that is, 2a(y) = 0. 
Hence 
2p(x) = 2(r(x) — a). 
Similarly, we get 2p(x) = 2(s(x) — b) = 2(t(x) — c). Thus 2p(x) is a solution of 
(F), and from Theorem 5.1, we have 


16p(x) = 4A(x) + B(x, x), 


where A is additive and B is biadditive. This proves the theorem. 


Remark. If H is a 2-divisible Abelian group in Theorem 5.1a, then 


D(Qx)=r(x)-a=s(x)-—b=t(x)-—c = A(x) + BOX, x). 


5.1.1 The Parallelepiped Law 
Theorem 5.1b. Any Banach space B (over R) that satisfies 
(5.9) lx ty +21? = lle + yl? + lly + zl? + lz + ll? = Mell? = My? = ell? 


forall x,y,z € Bis a Hilbert space (i.p.s.) and conversely. 


Proof. The converse is easy to verify. Let us assume that (5.9) holds. Define q : 
V — R by q(x) = ||x||*. Then g(0) = 0, gq is even, and q satisfies (5.6). Then, by 
Theorem 5.1, we have 
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q(x) = B(x,x)+ A(x) forx EV, 
where B is biadditive and A is additive. Since g is even, A = 0, so that g(x) = 


B(x,x). 
Note that 


B(x, y) = sla@+y)—¢qx)-4Q)] 


Ig(x+y)—q@—y)] 


NIL BILRNIR 


= x(x + yl? — [xll? —Ilyl?) forx,y € V. 
In general, B is biadditive only. Here B is bilinear. Indeed, fort,s €¢ R, x,y € V, 


IItx + yl] = I —s)x + sx + yll < It — s| [lel] + Ilsx + yl 


or 
| tx + yll — sx + yl] < [¢— | [x]. 


Let r, € QO such thatr, — t asn — oo. Then 
| lrnx + yl] — lltx + yl] < Irn — ¢] [lx || > 0 as n > 00; 


that is, ||7,x-+y|| > ||tx+y|| asr, > t forx, y € V. Note that since B is biadditive, 
B(rnx, y) = rn B(x, y), forrn € Q, x, y € V. Now 


rn B(x, y) = B(rnx,y) 
1 
= 5 |llrux + 9? = trax? — It? ] 
1 
> 5 [tex + yl? = lex? — Hy? ] as rn >t 
= B(tx, y). 


But then r, B(x, y) > tB(x, y) as r, — t. Hence, B(tx, y) = tB(x, y); that is, B 
is bilinear and so V is an i.p.s. 


If we use B(x, y) = Z[llx + yll? — lx — yl], then, forr, € O, ER, 
mB(x,y) = Birnx, y) 
1 
= 5 [Ura + yl? — trax — yl? ] 
— 2 [hie + yl? = tex — yP t 
4 y x—y|| as rn > 
= B(tx, y). 


This completes the proof. 
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5.1.2 Parallelogram Identity 


Next in line is the Jordan—von Neumann result [398]. By putting z = —y in (5.9), 
they obtained the identity (JVN), showed that the new condition is as strong as equa- 
tion (5.9), and proved Theorem 5.2. 


Theorem 5.2. (Jordan and von Neumann [398]). An n.i.s. V over R satisfying (JVN) 
must be a Hilbert space (that is, an i.p.s.). 


Proof. Suppose (JvN) holds. Define g : V > R by q(x) = ||x||*. Then q satisfies 
(Q), and by Theorem 4.1, g(x) = B(x,x), where B is a symmetric, biadditive 
function. Then use the argument in Theorem 5.1b to show that B is bilinear. Q.E.D. 


Subsequent authors have found norm conditions weaker than (JvN) that require 
a Banach space to be an i.p.s. Hereafter the central equation will be (Q); that is, most 
of the characterizations will follow from the parallelogram identity (JVN) and (Q). 
In a way, the equation (Q) arises in many characterizations of 1.p.s. 

Consider the three-dimensional parallelepiped identity (ROhmel [708], 
Kannappan [470]) 


lx ty +21? + lx ty — 217 + lx —y + zl)? + lle — yy — ell? 
= 2(\|x + yl]? + lx — yl? + [lx +21? 
+ |x = 2lI? + lly + 2lI? + lly Il?) 
— A(x]? + lly? + lz?) forx, y,z € Vannls. 


(5.10) 


Put y + z = 0 in (5.10) to get 
(5.11) [2x + zl? + |l2x — zl]? = 4Ilx + zil? + 41x — zil? — 6llz|l”. 
Replace z by 2z in (5.11) and use (5.10) to have 


Aljx + z\|? + 4llx — zl? = 4llax + 2z\I* + 4x — 2z1I* — 24llziI* 
= 16||x + zll* + 16] — zll? — 24l|zll* — 241", 


which is (JvN). Hence an n.].s. satisfying (5.10) is an i.p.s. 


As a generalization (functional equation version) of (5.10), we treat the func- 
tional equation 


dxt+yt+z)t+oat+y-—z)+o@—-—ytz2)+¢@-y-z) 
=2¢@+y)+b@—y)+o@+z2)+¢@ —2z) 
+ o(y +z)+ 4 —z)) —4(¢@@) + (1) + 6@)) 


forx,y,z EV, 


(5.10a) 


where ¢ : V > Rand V is ann_Ls. over R [470]. 


Theorem 5.3. [470]. Suppose ¢ : V — R satisfies the functional equation (5.10a). 
Then q(x) = $(2x) — 16¢(x) satisfies (Q). 
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Proof. First z to —z in (5.10a) give #(z) = ¢(—z); that is, g is even, and then z = 0 
in (5.10a) gives #(0) = 0. 
Let z = y and z = x + y separately in (5.10a) to obtain 


(5.12) d6&@&+2y)+@—2y) =4[O6+y)+o—y)]—-66(x)-8f()+2¢Qy) 
and 


p(2x + 2y) + (2x) + by) + 21b@ + y) — d@ — y)] 
+ 2[p(x) + (y)] — 216@ + 2y)+ ¢2x+y)]=0 forx,yeV. 


Replace y by —y in (5.13), add the equation thus obtained to (5.13), and use (5.12) 
twice (once with x and y interchanged) to get 


p(2x + 2y) + d(x — 2y) — 22x) — 26Q2y) — 164 + y) 
— 16¢(x — y) + 32¢(x) + 32¢(y) =0 forx,yeV, 


which transforms to (Q) by defining g(x) = (2x) — 16¢(x). This proves the 
theorem. 


Remark. The solution of (5.10) can be obtained from (5.10a). 

Suppose (5.10) holds in V. Define ¢(x) = ||x||? for x € V. Then ¢ satisfies 
(5.10a) and we get q(x) = —12'||x||?; that is, the norm satisfies (JVN) and hence V 
is ani.p.s. 


(5.13) 


5.1.3 More on Fréchet’s Result 


Putting successively x = u +20, y = v andx =u-+o, y = v in (JvN) and 
subtracting the latter from the former, we obtain 


(5.14) \[e + 30||7 — 3]]u + 20]|7 + 3]lu + oll? — |lull? =O foru,o EV. 


This equation is a special case (k = 3) of 
x(k 
2 ( Jeo + joll? =0 foru,v eV, 
cay NS 
j=0 
treated by Johnson [396] (see also [708]). The known proof that (5.14) implies V is 


an i.p.s. is based on Fréchet’s result [284] that if a continuous function g : R > R 
satisfies 


(5.15) -e (‘) (-1)" /g(r+js)=0 forr,seR 


j=0 


(that is, the (n + 1)th-order difference of g is zero), then g is a polynomial of 
degree at most n. Here, we give a simple and direct proof in Theorem 5.4 that (5.14) 
implies that V is an i.p.s. and obtain the general solution of (5.15) forn = 3 without 
assuming any regularity condition on g in Theorem 5.4a. 
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Theorem 5.4. Ann.l.s. V satisfying (5.14) is an i.p.s. 
Proof. Replacing u by u — v and u by u — 20 separately in (5.14), we get 


lu + 2v||* — 3|lu + ol]? + 3]lull? — Iw — ol]? =0, 
llu — 2v]|? — 3l]u — oll* + 3|lull? — jw + oll? =0. 


Adding the two equations above, we obtain 
(5.16) lu + 2o||? + lu — 2v|? = Alu + vl? + 4llu — oll? — Olu. 
We have after replacing 20 by v in (5.16) 

lle + ol? + [lu — ol? = [2 + ol? + [Qu — oF? — oul’, 
which, by using (5.16) again (this time interchanging u and v), gives 


lu + oll? + lu — ol]? = 4llu + oll? + 4]Ju — of? — 6lo|!? — 6a]? 


that is, (JVN). This proves the theorem. 


Now we turn to finding the general solution of (5.15) for n = 3. First note that 
the case n = 2 reduces to Jensen’s equation and that g(x) = A(x) +d, where A is 
additive; that is, g is a “polynomial” of degree at most 1. 

Equation (5.15) with n = 3 yields 


g(x + 3y) -—3g(4+2y)+3g(a+y)— g(x) =0 forx,y ER. 
Theorem 5.4a. Suppose f : V > R has all its third-order differences zero; that is, 
f satisfies 
(5.14a) fut3y)-3f(lu+2v0)+3f(tu+ov) — flu) =0 foru,ve V. 
Then f has the form 
(5.17) f@)=d+A(x)+q(Qx)=d+AQX)+BCx,x) forx eV, 


where d is a constant, A is additive, q satisfies (Q), and B is symmetric and biaddi- 
tive; that is, f is a “polynomial” of degree at most 2. 


Proof. Replacing u by x and v by x — y in (5.14a), we get 
(5.18) f(x + 2y)=3f@+ty)+fa—y)—3f@), x yeV. 


Interchange x and y in (5.18) to get 


(5.19) fQx+y)=3fe+y+fo-—-x)-3f() forx,yeV. 
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With y = 0, (5.19) yields 

(5.20) f(x) =3f(«)+ f(—x)-—3f(@) forx eV. 
Replacing x by 2x in (5.19) and using (5.19) and (5.20), we obtain 


(5.21) 2faty)+fa-y)+fO—x) = 3f@)+F(—x)4+3f + F(-y)—4F 0). 


Changing y into —y in (5.21) and subtracting the resultant equation from (5.21), we 
have 
J(y +x) + J(y —x) = 2J(y), 


which is Jensen, where 
J (x) = f(x) — f(-x). 
Since J is odd and J (0) = 0, we get 
(5.22) f(x) — f(—x) = A(x), where A is additive. 
Substituting (5.22) into (5.21), we have 


2f (x+y) +2f@ —y) —AG —y) =4F(%)— AG) +470) — AQ) —47 0): 
that is, 

q(x +y)+q@ — y) =2q(x) + 2q(y), 
where 


1 
q(x) = f&) — 5AG) — FO) 


(q satisfies (Q)). Now (5.17) follows from Theorem 4.1. This completes the proof of 
this theorem. 


Remark. The general solution of (5.21) can be obtained from [241] by identifying 
2f = fi, f@)+ f(-x) = fa), and 3 f(x) + f(—x) —2f (0) = A) = fa). 


Remark. If in Theorem 5.14a V = R and f is continuous and satisfies (5.14a), then 
f is a polynomial of degree at most 2 (see Fréchet [288, 290]). 


5.1.4 More Characterizations 
Result 5.5. (Amir [73]). Let 6 : V> > Rand V ann.Ls. over R be such that 
(5.23) 6@, y,z) = lx + y +217 + llx+y —2I? - lx —y +21? - lx -y—2ll?, 


for x,y,z € V, is independent of z. Then V is an i.p.s. 


Proof. Set z = x + y and z = 0 to have 


Allx + yl? — 4Ixll? — 4Ily ll? = lx + yl? — 2ikx — yl’, 


which is (JvN). So, V is ani.p.s. 
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Theorem 5.5a. Suppose ¢ : V3 > R,q: V > R, and V ann.Ls. over R are such 
that 
oe. y,2J=qaetytt+qe@ty—z 


(5.23a) 
ge = yp Fz) = ¢e— 9 =2),. Jorn yz V3 


is independent of z. Then there is an additive A : V — R and biadditive B : V7 > 
R such that 
q(x) = B(x, x) + A(x) +b) forx eV. 


This is a functional equation version of (5.23). 
Proof. z = x + y and z = x — y in (5.23a) yield 


q(2(x + y)) + q(0) — g(2x) — q(-2y) 
= q(2x)+q2y)-—q2(x—y))—qQO)  forx,yeV; 
that is, 
q(x + y) + q(e — y) = 2q¢(x*) + g(y) + 4(—y) — 2¢(0), 


which is (4.22). Thus there exist a biadditive B and an additive A (Theorem 4.22) 
such that 
q(x) -— q(0) = B(x, x) + A(x) forx eV. 


Remark. If g(x) = ||x||* in Result 5.5a, then g(0) = 0, A = 0 (since g is even), g 
satisfies (5.23), and g(x) = B(x, x). So, q satisfies (JVN) and V is an i.p.s. 


Theorem 5.6. [73]. Let V be ann.l.s. and ¢ : R > R be continuous with (0) = 
0, dC) = 1 and satisfy 


(5.24) P(llx + yD + A(x — yIl) = 20 (lx I) +2¢(lyI) forx,y eV. 
Then V is ani.p.s. 
Proof. From (5.24), it is easy to check that J(n||x||) = n?(||x|]) forn € Z4, x € 
V. Replacing x by #, we see that $ (2IxI]) = 2b (x[l) or Arla) = r2a(lxID, 
r positive rational, and x € V. Since ¢ is continuous, $(t\|x|]) = t7¢(|lx|l), t > 
0, t € R, x € V. Choosing x with ||x|| = 1, we get A(t) = t?4(1) = #7 since 
#(1) = 1. So, (5.24) becomes (JvN) and V is an i.p.s. 

This could also be achieved as follows. Define g : V > R by 


q(x) = (xl) forx ER. 


Then q satisfies (Q), g is continuous, and there exists a symmetric, biadditive B : 
V x V = R such that 


q(x) = B(x, x), 


1 
B(x, y)= 5 ae +y)—qax—y)), forx,y eV. 
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To prove the bilinearity of B, notice that g is continuous and B(rx, y) = rB(x, y), 
r € Q (see Theorem 5.1b). 
Thus V is an i.p.s. 


Result 5.7. (Sundaresan [780], Rassias [679], Amir [73]). Let V be ann.l.s. Suppose 
forn > 2 that 


n n 
(5.25) > Ii — FIP? = SP [all — ne? 
i=1 i=l 


nh 
holds for x1, X2,...,Xn € V withx = 1 > xj. Then V is an i.p.s. 
i=l 
Here (JN) is used. 


Next we will prove a functional equation analogue of (5.25) that will cover 
Result 5.7. 


Theorem 5.7a. (Kannappan [466]). Let V be a linear space over Rand ¢, yy :V > 
R satisfy the functional equation 


(5.25a) >) oa —¥) = >) 60a) — ny &) 
i=1 i=1 


for X1,%X2,...,Xn € V andx = i > x; for a fixed n > 3. Then 


n 4 
i=1 


p(x) = q(x) + AG) +e = BA, x) + AQ) +e, 


(5.26) 
v(x) = Br, x) + A(x), 


for x € V, where q satisfies (Q), A satisfies (A), B is biadditive, and c is a constant. 


Proof. Set xj = x2 = +--+ =X, =x in (5.25a) to get 
(5.27) w(x) =¢(x)—¢(0) forx eV. 


Now (5.25a) can be written as 


n n 
(5.28) >) oi —¥) = >) di) — nd) +2600). 
i=l i=l 
First let n be even, say 2k. Put xy = --- = xp =X, X41 = = XR = yin 


(5.28) to have 
(5.29) (=) +¢ (>) = $(x) + $(y) - 24 (=) +260). 


Next, let n be odd, say 2k + 1. Setxy = --- = xp =X, Xey1 = = XR = 
Y, X2e¢1 = >> in (5.28) to obtain 
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ko (=) + kp (>) + ¢(0) 
= kets) +0) +6 (F*) - ck + 00 (*) + ck + nee, 


which is (5.29). Now we solve (5.29). 
Replacing x by 2x and y by 2y in (5.29), we have 


(5.30) 2o(x + y) + O@ —y) +40 —x) = $(2x) + oy) + 26(0). 
Define 
(5.31) 2q(x) = o(x) + d(—x) — 260) forx EV, 
so that (5.30) can be rewritten as 

$(2x) + dQy) = 2q(x — y) + 26 + y); 
that is, 
(5.32) dtutv) + —v) =26(u)+2q(v) foru,v eV. 
Note that q is even and q(0) = 0. Change u to —u in (5.32) to yield 
(5.33) p(—u + v) + b(—u — v) = 24(—u) + 2q(0). 
Adding (5.33) and (5.32) and utilizing (5.31), we obtain 
(Q) qu +v) +q(u — v) = 2q(u) + 2q(0); 


that is, g is quadratic so that g(x) = B(x, x), for a biadditive B, by Theorem 4.1. 
Subtracting (5.33) from (5.32), we get 


A(u+v)+A(u—v) = 2A(u), 
which is Jensen, where 
(5.34) Alu) = du) — d(—u) forue V. 


Note that A(0) = 0 and A is odd. Thus A is additive. Adding (5.31) and (5.34) and 
from (5.27), we obtain the result sought, (5.26). This proves the theorem. 


Remark. In Result 5.7a, ¢ and y are obtained without assuming any regularity 
condition. It is easy to see that Result 5.7 is an easy consequence of Result 5.7a. 


Theorem 5.7b. (Kannappan [466]). Let V be an n.l.s. Suppose ¢, y : Ry > R 
satisfy the functional equation 


(5.25b) > ddl — FI) = >) ddlaill) — ay (FD 


i=1 i=1 
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n 
for X1,%X2,...,Xn € V andx = 4 > x; for a fixed n > 2. Then 


i=l 
(5.26a) P(x) = B@,x)+c, ywllx|l)=BO,x), forxeV, 
where B is biadditive and c is a constant. 


Proof. As in Result 5.7a, we consider two cases, where n is even and n is odd. 


Case 1. Letn = 2k. Set xy = x2 = +++ = XR =X, Xe4 = Xe42 Ss HS XQE = 
in (5.25b) to get 
IIx — yll Ix + yl 
2¢ (a = A(x) + Allyl) —2y | —— Js 
that is, 


(5.35) 26(Ix — yl) = dll) + PCy) —2y(lx + yll) forx,y eV, 
and 
(5.26) P(Ikxll) — w(lkxll) = GO) (put y = x in the above), 
and so 

2p(\lx + yll) + 2A(llx — yl) = PCI) + d(l2yll) +246@)  forx,y € V. 
Case 2. Let n = 2k + 1 be odd. 


Let x) = x2 =+:+ =X =X, Xke] = ces = XR = y, and x94) = > in 
(5.25b) to obtain 


26 (FT) 600) = coax rkotiy ed (AS) —@e+ny (=) 


and (5.27), and thus (5.35). 

Now we solve (5.35). Set x + y =u, x — y =v, and q(x) = G((|xl|) — d(0) 
in (5.35) to have (Q) satisfied by g. Then, from Theorem 4.1, it follows that g(x) = 
B(x, x) for x € V, where B is biadditive, and then (5.26a) holds. This proves the 
result. 


Theorem 5.7c. [466]. Let V be ann.l.s. over R, g : Ry — R be continuous, and 
h:R4— R satisfy the functional equation 


(5.25c) > 8 (lx: — FI?) = D7 g (lai?) — nh I?) 


i=1 i=1 


n 
= 1 ; ; 
forn > 2, x1,%2,...,%n € V, X = = > xj. Then V is anip.s. 
i=l 
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Proof. If g is a solution of (5.25c), so is g —c. So, without loss of generality, assume 
g(0) = 0. We can use either Result 5.7b or Result 5.7a (see [466]) to prove this 
result. We use Result 5.7b. Define ¢, y : V > Rby 


P(lxll) = g(lxll?), wdlxll) =ACixlI?), x eV. 
Then (5.25c) becomes (5.25b), and by Result 5.7b we get 
e(llxll?) = A(lxll) = Bx, x) = w(x) = A(x) forx eV 


since c = Oand A = 0 (since ¢ is even), where B is asymmetric, biadditive function. 
Now g satisfies 


g(llx + yll?) + g(x — yl?) = 2g(Ilx ll?) + 2e Cyl) 


forx,y eV. 

By induction, it follows that g(|lnx||?) = n7g(\|x|I*), g(r2t*) = r2g(t?), for 
r € Q, t € R4, and by continuity g(p*t*) = p*g(t?) for any p,t € R4; that is, 
g(p?) = cp” for some constant c, p € R,. Then the norm || - || satisfies (JVN) so 
that V is ani.p.s. Hence we get the result. 


Next we consider the functional equation 
(5.36) d(t)= f(«t+ty) forx,yeV,teER. 


In Amir [73] and R6hmel [708], it is proven that if O(f) = |lx + ty ||? forx,y € Van 
n.l.s., and ¢ € Ris a polynomial in ¢ of degree 2, then V is an i.p.s. Now we prove 
the following result. 


Theorem 5.8. (Kannappan [470]). Suppose that the function f : V — R is such 
that, for each x, y in V, the function ¢ : R — R given by (5.36) is a polynomial 
function of degree 2 in t; that is, there exist a(x, y), b(x, y), and c(x, y) such that 
(5.37) 

P(t) = ftx+ty) =a, y)t? +b(x,y)t+c(x,y), fort €Randx,yeV, 


holds. Then f and ¢ are given by 
(5.36a) ff) =q(x)+ A(x) +d = Bix, x) + A(x)+d forx eV, 
(5.36b) P(t) = q(y)t" + 2B, y) + AG) +(e) +A) +d 


fort € R, x,y € V, where A is additive, B is biadditive, q satisfies (Q), and d is a 
constant. 


Proof. Since ¢(t) is a quadratic function of t, we can write (5.37) as 
(5.38) P(t) = f(x +ty) =at? +bt+c, 


where a, b,c are functions of x and y. (Since we will not substitute anything into 
a,b,c as such, we don’t write their arguments x, y explicitly.) 
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Setting tf = 0 in (5.38), we get 
f(x) =c. 
Replacing t by —t in (5.38), we have 
(5.38a) f(x —ty) =at? —bt+e. 


Subtracting and adding (5.38a) and (5.38), we have 


(5.39) fQ +ty) — f@ —ty) = 2bt 

and 

(5.40) f(x +ty) + f(e —ty) = 2at? + 2c = 2at? +2f (x), 
respectively. 


Letting t = | and 2 separately in (5.39) and (5.40) and eliminating b and a, we 
obtain 


f@+2y)— f@ —2y) =2(fa+y)-— fe —y)) 
and 
f@+2y)+ f@—2y)=41fG +y)+ f@—y)] —6f(@), 
from which follows 
(5.14a) f@+2y)=3fety)+fa-—y)—3f) forx,y eV, 
which is (5.18). 
Now, from the proof of Theorem 5.4a, it follows that 
1 
f@)=4@)+ zAG@) + f(0), 


which is (5.36a). Note that t = | in (5.39) and (5.38a) give b and a by 


fat+y)— f@—y) = 2b, 
(5.41) fat+y)+ f@—y)-2f(@)=2a, forx,yeV. 


Now the form of g(t), (5.36b), follows from (5.37), (5.36a), (5.41), and Theo- 
rem 4.1. This completes the proof of this theorem. 


Corollary. Suppose V is ann.l.s. andx,y € V. Let ¢, F : R > R be such that 
f(t) = F(x + ty|l?) fort € R is a polynomial in t of degree 2. Then V is ani.p.s. 


Corollary. [470]. Suppose ¢,g : R — Rand g is continuous such that $(t) = 
g(||x + ty||) is a polynomial of degree 2 int for x, y € V. Then V is ani.p.s. 
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5.1.5 Some Generalizations 
5.1.5.1 Solution of a Generalization of the Fréchet Equation 


We now devote our attention to the study of the following functional equation that is 
a generalization of the Fréchet condition (F): 


p(x +y +z) = pi + y)— poz) + p3(y +2) 


(5.42) 
— pa(x) + ps(z+x)— poly) forx,y,z EG. 


We prove the following theorem regarding (5.42). 


Theorem 5.9. (Kannappan [471]). Suppose that p, pi: G > H (i = 1 to 6) satisfy 
the functional equation (5.42). Let G be a group (which need not be Abelian) and H 
an Abelian group. Then there exist additive functions A, Aj : G > H (i = 1,2,3) 
and a biadditive map B : G x G — H such that p, p; satisfy 


8 p(x) = 4A(x) + B(x, x) + 8c, 

8pi(x) = 4A(x) + B(x, x) — 8Ai (x) — 8(C1 + c2 —c — ba), 

8p2(x) = 4A(x) + Bx, x) — 8A2(x) — 8A3(x) + 8b2, 
(5.42a) 8 p3(x) = 4A(x) + B(x, x) — 8A3(x) — 8(c5 + 6 —c — ba), 

8p4(x) = 4A(x) + Bx, x) — 8A1(x) — 8A2(x) + 8b4, 

8ps(x) = 4A(x) + B(x, x) — 8A2(x) — 8(c3 + c4 —c — Bg), 

8po(x) = 4A(x) + B(x, x) — 8Ai (x) — 8A3(x) + 8bo, 


where c, bo, ba, b6, C1, C2, C3, C4, C5, Co are constants. 
Proof. Letting z = 0 in (5.42), we obtain 
p(x + y) — pia + y) + b2 = p3(y) — poly) + ps) — pa(x), 
where p2(0) = b2, which is a Pexider equation. So, there is an additive A; such that 


(p — pi)(x) = Aix) +1 +2 — bo, 
(5.43) (p3 — po)(x) = Ai(x) +1, 
(ps — pa)(x) = A(x) +2, 
where c1, c2 are arbitrary constants. 


Similarly, setting y = 0 and z = 0 separately in (5.42) results in the Pexider 
equations 


p(x +2) — ps(e +z) +6 = (pi — pa)(x) + (p3 — p2)(Z) 


and 
P(y +2) — p3(y +z) + ba = (pi — po)(y) + (ps — p2)(z), 
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respectively, with p4(0) = b4, po (0) = be, and the solutions 


(p — ps)(y) = Aa(y) +.¢3 + c4 — bo, 
(5.43a) (pi — pa)(y) = A2(y) +¢4, 

(p3 — p2)(y) = A2(y) +63, 

(p — p3)(z) = A3(z) +.¢5 + c6 — ba, 
(5.43b) (pi — po)(z) = A3(z) + ¢5, 

(ps — p2)(z) = A3(z) + co, 


where A2, A3 are additive and c3, c4, C5, C6 are constants. 
Now use (5.43), (5.43a), and (5.43b) to express p; to po in terms of p as 


pi(x) = p(x) — Ai(®) — ¢c1 —c2 + bo, 
ps(y) = p(y) — A2(y) — ¢3 — c4 + be, 
p3(z) = p(z) — A3(z) — cs — c6 + ba, 
p2(y) = p3(y) — Ao) — ¢3 
(5.42b) = ply) = A3s(y) = An) — cs = cs — cg + Oa, 
pay) = pi(y) — A2(y) — ca 
= ply) — A1Q) — Aa) — ca — C1 — 2. +, 
Po(x) = p3(x) — Ai@®~) - 1 
= p(x) — A3(x) — A1@®) — c1 — ¢5 — 6 + bg. 


Substitution of these p; to ps given by (5.42b) into (5.42) using that A,, Az, and A3 
are additive results in 


(F) p(x +y +z) = p(x +y)— p@) + pY +z) — p&) + p@+x)— pY) +e 
with p(0) = c, by =c—c3 —c5 —c56 + ba, D6 = C — C1 — C5 — C6 + ba, and 
cy +c2 +c4 = c3 +5 + ce. Thus, by Theorem 5.1 there exists an additive A and a 
biadditive B such that 

8 p(x) = 4A(x) + B(x, x) + 8c. 


Now the equation (5.42a) we seek is obtained from (5.42b) and the equation 
above. This completes the proof of this theorem. 


The aim now is to solve the equation (a generalization of (5.23) and (5.23a)) 
(5.23b) (x, y,z) = f@+yt+z)+e@+y—z)+h—yt+z)+k@—y—z2), 


with ¢ independent of z, forx, y,z € V. 


Theorem 5.10. (Kannappan [467]). Suppose ¢ : V* > R, and f,g,h,k:V > R 
satisfying (5.23b) does not depend on z for x,y,z € V. Then f, g,h,k are given by 
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f@) = —q(x) + As) + b1, 

g(x) = —q(x) + Asx) — Aix) + do, 
h(x) = q(x) + Ar) + bs, 

k(x) = q(x) + Ao) + A1@) + ba, 


(5.23c) 


for x € V, where q satisfies (Q), Ai, A2, and A3 are additive, and b,, b2, b3, and 
bg are constants. 


Proof. Since ¢ is independent of z, set z = x + y, 0, and —x — y separately in 
(5.23b) to obtain 


f(x +2y)+ g(0)+h@x) +k(-2y) 
(5.44) =faxt+ty)+eat+y)+h@—y) +k —y) 
= f(0)+2(2x +2y)+A(—2y)+k(2x) forx,y eV. 


Replace 2x, 2y by x, y in (5.44) to get 
(5.45) (f-syaty)=Kk-hA)@)+@—-kH)(-y) +1, 
where c; = f (0) — g(0), which is Pexider. Hence there is an additive A, such that 


(5.46) f@) — g@) = Ai@) +¢1, 
(5.47) k(x) —h(x) = Ai(x)+c2, forx € V, co =k(0)—h(O). 


From (5.46), (5.47), and (5.44), we have 
(5.48) 2f(x + y)+ 2h —y) = f(0) + fx +2y) + h(—2y) + h(2x). 
Let y = x in (5.48) and then replace 2x by x to yield 

2f (x) + 2h(0) = f(0) + f(x) + h(—x) + h(x). 
Now use the equation above in (5.48) to get 
(5.49) 2h(x — y) +h(x + y) +h(—x — y) =h(2x) + h(—2y) + 2h(0). 
Interchange x and y in (5.49) and subtract the resultant from (5.49) to have 
2A2(x — y) = A22x) — Ary), 

which is Jensen, where 
(5.50) A2(x) =h(x) —h(—-x) forx eV. 


Note that Az is odd, A2(0) = 0, and thus that A2 is additive. From (5.49) and (5.50), 
we obtain 


2h(x — y) + 2h(x + y) — Ar(x + y) = Ax) + hy) — Ary) + 2h(0), 
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which can be rewritten as 


2h(x + y) — Are + y) — 2h(0) + 2h — y) — Aa(x — y) — 2h) 


1 1 
=h(2x) - 7 422%) —h(0)+h(2y) - 7422) —h(0); 
that is, g satisfies the quadratic equation (Q), where 
1 
q(x) = h(x) - x2) —h(O) forx eV. 


Thus we obtain h, and (5.47) gives k. It follows from (5.41) that g(0) = 0 and q is 
even. 

Now we determine f and g. Now, letting z = x — y and —x — y separately in 
(5.23b), we get 


f (2x) + gy) +h — y)) + KO) = FO) + g(x + 2y) +h(—2y) + k(x), 
which by replacing 2x by x and 2y by y and using h and (5.47) becomes 
f@)+fO)+a@-y=fO+fA+y)+40)+4@); 
that is, 
F(x) + £0) —9@ + y) + 2g) + 24(y) = FO) + f&+y) +a) +4) 
or 
(f(x) +4) — FO) + FO) +40) — fO) = f@+y+a@+y)— FO), 
which is additive. Thus 
A3(x) = f@)+4q@)-fO), forxeV, 


is additive. Now we obtain f, g from (5.46). This completes the proof of the theorem. 


Remark. Note that the solution of (5.23b) is obtained without assuming any regu- 
larity condition on the functions. 


5.1.6 Some More Characterizations 
The functional equation 


(3.51) a(g(x)y + g(y)x) = g@)gO)g@ + y) 


related to (Q) has been studied by many authors in connection with a characterization 
of i.p.s. under some additional conditions satisfied by g (see [44, 218, 822, 471]). The 
following two results are proved in [822]. 
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Result 5.11. (Volkmann [822]). Suppose g : R4 — Rx is a continuous solution of 
the functional equation (5.51) for x, y € Ry with g(0) = Oand g(x) > Oforx #0. 
Then g(x) = cx’. 


Result 5.11a. [822]. The continuous solutions g : R — R of equation (5.51) for 
x,y € Rare given by g(x) = 0, g(x) =+1, g(x) = cx’, and 


cx”, x>0 0, x>0 
=f eee" g(x) = a ee where c > 0. 


c 


Result 5.11b. (Aczél and Dhombres [44], Dhombres [218]). Let g: R ~ R bea 
continuous solution of (5.51) for x, y € R with g(0) = 0, g(x) > O for x 4 0, and 
lim g(x) = 00. Then g(x) = cx?. 

=F CO: 


Here we obtain the continuous solution of (5.51) under different conditions first 
by a simple and direct method and then with the general solution by reducing it to 
the quadratic equation (Q). As a matter of fact, we prove the following theorem. 


Theorem 5.12. (Kannappan [471]). Let g : R — R be a continuous function 
such that g is 1-1 both on the nonnegative reals and the nonpositive reals; that is, 
{g/R+ : R+ > Rand g/R_ : R- — Rare injective}. Then g is a solution of the 
functional equation (5.51) holding for all x, y € Rif, and only if, 


(5.51a) g(x) =cx* forx ER, 


where c is a constant. 


Proof. Setting x = 0 = y in (5.51) gives g(0) = g(0)’. So, either g(0) = 1 or 
g(0) = —l or g(0O) = 0. 

Let g(0) = 1. Then x = 0 in (5.51) yields g(y) = g(y)? for all y € R. Hence 
g(y) = 0 or 1, contradicting that g is 1-1 on Ry. 

Now consider the case g(0) = —1. With x = 0 in (5.51), we get g(—y) = 
—g(y)* for all y € R; that is, g(y) = —g(—y)* = —g(y)*, again contradicting the 
one-to-oneness of g on Ry. 

Thus g(0) = 0. Taking y = —x in (5.51), we have g(—g(x)x + g(—x)x) = 0; 
that is, —xg(x) + xg(—x) = 0 for all x since g(0) = O and g is 1-1 on R* andR-. 
Therefore, g is even; that is, g(—x) = g(x) forx ER. 

First set y = x and then separately y = —2x in (5.51), and then use that g is 
even to obtain 


g(2xg(x)) = g(x)?g(2x) = g(—2xg(x) + g(2x)x). 


Since g is even and 1-1 on nonnegative reals, we can conclude either —2xg(x) = 
—2xg(x)+g(2x)x (that is, g(2x) = 0, for x 4 0, whichis not possible) or 2xg(x) = 
—2xg(x) + g(2x)x; that is, 


(5.52) g(2x) =27g(x) forallx ER. 
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Now put y = 2x and y = —3x separately in (5.51) and use that g is even and 
(5.52) to have 


g(2g(x)x + 4g(x)x) = 4e(x)*g (3x) 
= g(—3xg(x) + xg(3x)) forallx ER. 


As before, 6xg(x) = 3xg(x) — xg(3x), in which case g(3x) = —3g(x) for all 
x € R. With y = 3x in (5.51), we get g(3xg(x) + gx)x) = 16g(x)7g(3x) (that 
is, —48g(x)> = 0 for all x), which cannot be. So, 6xg(x) = —3xg(x) + xg(3x), or 
g(3x) = 3*g(x), forall x ER. 

By induction, it follows that 


(5.52a) g(nx) =n7g(x) 
for all integers n and x € R. Now (5.52a) yields g(=) = g(x) forn ¢ 0, and 
2 
g (=x) = (=) g(x) for rational ” and x ER. 
n n n 


We invoke the continuity of g to obtain (5.51a), where c = g(1) is a constant. 
This completes the proof of this theorem. 


Remark. [471]. Traditionally the solution (5.51a) has been associated with the 
quadratic equation (Q), so we will derive (Q) from (5.51) and the hypothesis of 
Theorem 5.12. Indeed, we prove Theorem 5.12 by linking (5.51) to the quadratic 
equation (Q). 

Note that g is even and that g(2x) = 4g(x). Changing y into y — x in (5.51), we 
have 


a(g(x)(y — x) +a(y —x)x) = g(x)g(y —x)g(y) 
= g(-g(x)y + a(y)x) (y to —y in (5.51)). 


Now, since g is even and 1-1 on R;, we can conclude that g(x)(y — x) + 
xg(y — x) = g(x)y — xg(y) (that is, xg(y — x) = x(g(x) — g(y)) forall x, y € R), 
which for y = 2x gives xg(x) = x(g(x) — 4g(x)) (that is, g(x) = 0 for x # 0), 
which cannot be. So 


(5.53) xg(x — y) =x(g(x) + g(y)) — 2yg(@x) forx,y ER. 


Replacing y by —y in (5.53) and adding the resultant equation to (5.53) and using 
g even, we get 


x(g(x + y)+ g(x — y)) = 2x(g(x) + g(y)) forallx,y eR. 


Hence g satisfies the equation (Q), the continuous solution of which is given by 
(5.51a) (see [463], Result 4.11). 


Remark. [471]. The general solution g : R > R satisfying (5.51) with g one-to-one 
on R, and R_ is given by 
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g(x) = B(x, x), 
where B : R x R > Ris a biadditive map. 


Proof. From the proof in the remark above, we can conclude that g satisfies the 
quadratic equation (Q). Then, from Theorem 4.1, we get that g(x) = B(x, x), where 
B is biadditive. 


Note that here we have obtained the solution of (5.51) without assuming any 
regularity condition on g. 


The following remark will be of use in what follows. 


Remark. Let V be a linear space over R and f : V > R satisfy f(tx) = |t| f(x) 
for x € V, t € R. Then the following statements are equivalent: 


(sl) f(x) = f(y) implies f(ax + by) = f(bx +ay), 
(s2) f(x) = f(y) implies f(@ +1y) = f(tx + y), t #0,1, 
(s3) fe +y) = f@ — y) implies f(x + ty) = fe —ty), t # £1,0, 


forx,y EV, a,b,t ER. 


Proof. (s1) = (s2) is obvious. 
Suppose (s2) holds and f(x + y) = f(« — y). Then, by (s2), 


1-t 
1+t 


2 _ 2 
f(a e+) =F(5e-m). 
which is (s3). 


Suppose (s3) holds and f(x) = f(y). Then 


f(xty+ 


—_ 
w=») =2 (F044 6-») 


or 


1 1 
F(F@+9+6-m) =F G@+y (x —y))-—@ »), 


Using (s3) we have 


1 a—b _ 1 a—b ; 
£(5 (x44 SSo-w)) =6(G (e+ y- Shem): 


that is, 
(ax +by)\ _ (bx + ay) 
a a 


which is (s1). 


If, instead of f, a norm || || and areal n.l.s. V are used, the following are equiva- 
lent: 


(r1) [|x| = lly|l implies [ax + by|| = ||bx + ayll, 
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(r2) |x|] = Ily|| implies ||x + ty|] = |Itx + yl], t £9, 1, 
(3) |lx + yll = [lx — yl| implies |[x + ty|| = |lx —tyll], ¢ A 0,+£1, forx,y € 
V,t,a,beR. 


Now (12) > (14), where 


(4) x,y,2 EV, x tyt+z=9, |lxll = Ilyll > lle —zil = lly — ll. 


Suppose (r2) holds. Let x+-y+z = 0 with ||x || = ||y||. Then ||x+2y|| = ||2x+y]| 
(by r2); that is, || — z + y|| = ||x — z||, which is (14). (74) says a triangle is isosceles 
if, and only if, two medians are equal. Note that the implication is symmetric. That 
is,x —y+y—z+z—x =Oand |ly — z|| = lz — xl] imply [xl = lly. 


Ficken [271] proved the following classical theorem. 


Result 5.13. [271]. In order for ann.l.s. V over R to permit the definition of an i.p., 


it is necessary and sufficient that, whenever ||x || = ||y|| forx, y € V andt € R, we 
have 
(5.54) Ix + tyl] = IItx + yl. 


Now we will prove the functional equation version of Result 5.13. As usual, we 
will relate it to (Q). 


Theorem 5.13a. (Aczél and Dhombres [44], Dhombres [218], Day [214]). Suppose 
V is a real topological space and f : V — Ris a continuous function satisfying 


(i) f(x) = Oif and only if, x =O and f(x) > O forall x #0, 
(ii) f (tx) = |t| f(x) forx € V, t ER, and 
(iii)whenever f (x) = f(y) for x, y € V, we have 


(5.54a) f(x+ty) = f(txt+y) forallt eR. 
Then f* satisfies (5.51), and q(x) = f (x) for x € V is a quadratic functional; that 
is, q satisfies (Q). 
Proof. The proof is achieved by two auxiliary results. 


Proposition (i). [f f(x) = f(y) = flax + d — a)y) for some x, y € V and some 
a € ]0, l[ and f is continuous and satisfies (1), (11), and (iii), then x = y. 


Proof. The proof is based on induction and the liberal use of (5.54a). 
Now f(y) = f(ax + —a)y) = f(a — y) + y) with the use of (5.54a) 
implies 


fla — y) + y} + @ — Iy) = f(a — Ifa — y) + y} + y); 


that is, 


flax) = flal@ —-D@—-y)+y) 


or f(x) = f(A —a)(« — y) — y), which is also = f(y) (use (ii) noting that f is 
even and take t = —1). 
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Repeating the argument above, we get (using (5.54a)) 


fd — a) —y)—y}+@-a)y) = f(2—-—a){U —a)( —y)—y}+y) 


or 


f( — a)x) = f( — a)[2 — a)(x — y) — yD); 
that is, 
f@) = f(2@-a)\a—y)—y)=f(y) forx,yeV. 
Now we invoke induction. Suppose that 
f@)=f(2-aa-—y)-y)=fO) forx,yEeV, ne Z4. 
Then, by (5.54a), 


f(I@—a)a—y)—y]+@+1—a)y)= f(a2+1—a)[m—a)@—y)—y]+y); 


that is, 
f(m@ —a)x) = f(r —a)(n + 1—a)(* — y)—y)). 
Thus, by (ii), f(x) = f((n + 1 — a)(x — y) — y) and induction holds. Hence 
_f@) _-, (« ae —— 
(nt+-l)-—a : (nt+1)-a]’ 


Allowing n tending to infinity and using the fact that f is continuous, it is easy 
to see that f(x — y) = Oorx = y (use (i)). This proves Proposition (i). 


Remark. Note that the proof works as long as a is not a positive integer. 


Proposition (ii). For any x, y € V, x, y #0, and positive real r, there are no more 
than two distinct real values of t such that 


f(x +ty) =r, 


where f satisfies (1), (ii), and (iii). 


Proof. Suppose there exist three distinct ¢; (@ = 1, 2,3) such that 
(5.55) fathy)= fat+hy)=fat+hy) forx,yeV. 


Case (i). Suppose {x, y} is linearly dependent, say x = Ay. Then (5.55) yields (use 
(ii) 
[A+ t|=|A+t| =|A+f3| since f(y) £0. 

A possible case is A + t) = A + f2 implies tf) = f2 and there is nothing to prove. 
Similarly, 2+ tf) = 24+ 3, and2+ f% = A+ imply t) = f or fo = fs and there is 
nothing to prove. 

It could be that 2-+f; = —(A+f2), implying t)+f2 = —24 and A+t; = —(A+4:3), 
giving t) + 3 = —2A or fp = £3. There is nothing to prove. The remaining cases can 
be argued similarly. In this case, 


272 5 Characterization of Inner Product Spaces 


qx+y)+q@—-—yl=frt+yPtfe-y/y 
=[(1+4)?+(1-A) If)" 
= (2+ 247) f(y)? 
= 2f(y) +2f (ayy 
= 2q(x) + 2q(y), 
which is (Q). 


Case (ii). Suppose {x, y} is linearly independent. Then {x + ty, x + f2y} is linearly 
independent. Since linear span {x, y} = linear span {x + fiy,x + hy}, x +hy = 
a(x +tiy)+ fj + ty) for some a, £. Without loss of generality, we may suppose 
that x + f3y lies in the segment |x + tyy,x +fy[. Then a+ 6 = 1 witha € JO, I[, 
orx+fy=a(x+ty)+Ud-—a)a+hy), fora € ]0, 1[. 

Now (5.55) becomes 


f@+thy)=fa+hy) = fl@a@tny)+d-a)@+ny)), 


and, by Proposition (i), x + f1y = x + ty, which is not the case. This proves 
Proposition (ii). 


Proof of Theorem 5.13a. Using (5.54a), forx, y #0, x 4 y, 
Xx = y-x 
Ce) etd a) 


f (re -y a Oa, a) = forty x 


implies 


that is, 


(5.56) fie+ty)fG)F@—y) = FFG» + F@ — yl? — 2F D2] 


for x, y € V (including x = y, x,y = 0). 
Note that (5.56) is “close” to (5.51). In order to get (5.51) precisely, start with 


r( - )=4(4) (x,y #0) 
fa) ~ VFO) - 


and obtain 


x x 
§(1@75 5+ £075) = § (F075 yee IF) 


or 


(5.56a) fae +y)f@SFO) = FF0)x + f@x)’y), 
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which by squaring both sides yields 


a@q(yq@+y)=9q)x+9@)y), 


where q(x) = f (x)*, which is (5.51). 
Changing y to —y in (5.56), we have 


(5.56b) f(x ty) fx) f@ —y) = FI2F@)y + AFG) — f@+y))a] 


or use (5.56) by replacing x by > and y by y + 3 to have 


f+ yf (5) f= F (27 (5) (v+5)+ [7-9 —2f (5) | =) 


which is (5.56a). 
Replacing y by —y or x by —x in (5.56a), we have 


fx —yf@FO) = FF @)’y — f0)’x), 


from which we obtain 


2, 2 
(Sete | = 96) 


FO)F@ — y) 
Xx 
-1 (200555). 


Now (5.54a) can be rewritten as (take x, y 40, x A y) 


(5.57) 


F@)FO) FO) 
_ (oe —fO)Px) f&-y) x ) 


Dee aes 2 
rorya (8 OPE 1 45 2 ) 


2 ; 
fole=)- 2G) ee 


Deke _, 2 
and by (5.57) and (5.54a) = f Ga +IG= ngs) 


that is, 


fx +y) fe — y) f(x) 


(5.56c) d : ; 
= f2FQ)Yy + (F@— yy —2fO))x) forall x,y € V. 


Changing x to —x or y to —y in (5.56c) yields 


FS +y) FO — WF) 


(5.56d) ‘5 ‘i 5 
=f2fa)y y+ 2fO) — fatty) x) forx,y eV. 
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Let us denote 


a= fx—y)y—2f0), 
B=2f0)-fartyy, 
y= f@-y)—2F(@), 
5=2f ay —fae+yy, 
and invoke Proposition (11). 
Comparing (5.56), (5.56b), (5.56c), and (5.56d) and using Proposition (ii), we 


find that among the four numbers a, /, y, 6 there are at most two distinct numbers. 
This leads to the following four cases: 


Case 1. a = 0 implies that g satisfies (Q). 

Case 2. 8 = y implies that ¢ satisfies (Q). 

Case 3. a = B and y = d yields f(x + y)* + f@ —yY =4f 0) = 4F x)’; that 
is, (Q). 

Case 4. a = y and f = 0. Then f(x) = f(y). We have two subcases to deal with. 


Subcase (i). Suppose f(x + y) = f(x —y). Note that {x, y} is linearly independent. 
We deduce from (5.56c) that 


2f(y)? 2f(y)? 
a (oro ( Ay) 


If2f(y)* < f(x — y)’, then, by Proposition (i), x = y, which is a contradiction. 
Suppose f(x — y)? < 2f(y)*. We will show that this is also impossible. 
Set x + y =u and x — y = v. Since {x, y} is linearly independent, so is {x + y, 
x — y} = {u, v}. Starting with uw, v and following the argument above corresponding 
to (5.56c), we get 


fluto)f(u—v) fl) = f2f(u)?o + (fu — v)* — 2f (v)”)u); 


that is, 


f2x)fQy) fe +y) = fl2f@+yP?@ —y) + Fy? —2f@ - yD — y) 


or, since f(x + y) = f(x — y) and f(x) = f(y), we get 


= _ | f@-yy f@-yy 
fe-nasory= | Boone Born] 


Since f(x — y)? < 2f(y)’, by Proposition (i), we have x + y = x — y, whichis a 
contradiction. Hence f (x — y)? = 2f (x)? and (Q) holds. 


Remark. If V is a real n.l.s. and f is replaced by the norm || - ||, the conclusion 
above can be obtained by using the triangle inequality in (5.56c). 
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Finally, we consider the following subcase. 


Subcase (ii). f(x + y) # f(x — y). Then start with vu = x + y, v = x — y andend 
up with cases 1, 2, 3, 4. Now case 4 is impossible since f(u) A f(v). In all other 
cases, we get 


qu + v) + q(u— v) = 2q(u) + 2q(v); 


that is, 
q(2x) + q(2y) = 2q(x + y) + 2q(x — y), 


which is (Q) since g(tx) = f(tx)* = t?q(x). 
Thus q(x) = f (x)? satisfies (Q) in all cases. 


Corollary. Proof of Result 5.13 [271]. By Theorem 5.13a, q(x) = ||x||? satisfies 
(Q); that is, (JVN). Thus V is an i.p.s. 


A considerable weakening of (5.54) is [606] 
(5.54b) dc > 1 such that ||x + cy|| = |ly + cx|l, 


for x, y € Sy, the unit sphere of V. 


5.2 Geometric Characterization 


Now we present an application of Result 5.13 or 5.13a. It is well known that the 
length of the median of a triangle is determined by the lengths of the sides of the 
triangle. This is the geometrical interpretation of (5.58). The triangle with sides 
x,y,x — y has s(x + y) as its median through the origin; this means there exists a 
function ¢ : R3 — R, for which 


(5.58) lx + yll = €CIxI, lly. lx — yl) forx,y eV, 


where V is a real n.l.s. (Aczél and Dhombres [44], Amir [73], Aronszajn [78], Day 
[214], Dhombres [218]). 


Theorem 5.14. [44, 218, 214, 78]. Suppose V is a real n.l.s. such that the lengths of 
the sides of any triangle in V determine the lengths of the medians. Then the norm 
of V comes from an i.p. 


Proof. The geometrical property leads to the functional equation (5.58). We will now 
derive (5.54). The proof is by induction and is different from that in [44, p. 138]. First 
we will show that, for x, y € V with ||x|| = |ly|l, 


(5.59) lax + yl] = llx +nyl| forn € Z. 
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For n = 1, it is trivially true. Using (5.58), we obtain 


2x + yl] =I@+y) +l 
= €(\lx + yl Ill, yD 
= C(x + yl, ly lel) (since |x|] = lly) 
= |x +2yl|l. 


Suppose (5.59) holds for all k < n. Then 


l(a + Ix + yll = lx + y) +4 
= €(|nx + yl, IIx, I — Dx + yl) 
= E(x + ny), yl, I@ — Dy + x11) 
= ||x+ (1+ ly, 


and so (5.59) holds for all positive n. Obviously (5.59) holds for all n € Z. 


For [|x|] = |lyll, |] =|] = ||2]], so that by (5.59) we get 
|» rte |= +y| foralln € Z*. 
n n 
Now we will prove by induction on m, for fixed n and ||x|| = ||y|l, 
(5.59a) |nx + my|| = ||lmx + ny|l, 


Inx + 2y|| = ||(ax + y) + yIll 
= €(|lnx + yll, lly. axl) 
= E(x + ny), Ilxll, lay) 
= ||2x + ny]. 


Suppose (5.59a) holds for all k < m. 


|Inx + (m + 1)y|] = |(2x + my) + yl 
= €(|Inx + my|l, lly], lax + (m — Dy) 
= €(\lmx + ny|l, lll, ln — Dx + ny) 
= ||(m + 1x +ny|). 


Thus (5.59a) is true. Also, it is easy to see that (5.59a) holds for all m,n € Z. 
It follows that || 4x + y|] = |x + 4y|| forn,m (4 0) € Z, x,y € V with |[x|| = 
Ily Il. 

Since ||rx + y|| = |lx +ry|| for all r € Q, and || || is a continuous function, 
(5.54) holds for x, y € V, t € R. Thus V is ani.p.s. 


Theorem 5.15. [44, 218]. Let V be a linear space over reals and q : V > R+ such 
that q(0) = 0, g(x) 4 O forx #0, x € V. Suppose that, for all x # 0 in V, there 
exists a subset I, of R of positive Lebesgue measure Mx such that q(Ax) < My, for 
all 2 € I, (ray bounded). Further, if q is a solution of (Q), then V is an i.p.s. 
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Proof. Since q satisfies (Q) (by Theorem 4.1), there exists a symmetric, biadditive 
B:V x V — R such that 


1 
B(x, y) = slate + y)—a(e)—4O)] forx,y eV. 
To show that B is bilinear, define A : R > R by 
A(t) = B(tx,y) forte R, x,yeV. 


Obviously, A is additive and for t € J, we have 


2A(t) = q(tx + y) — q(tx) — q(y) = —q(tx) — q(y) = —Mx — ¢(y); 


that is, A is bounded below on a set of positive measure. Hence A(t) = A(1)f for 
t € R; that is, B(tx, y) =tB(x, y). So, B defines an i.p. in V. This proves the result. 


Now we prove the following generalizations. 


5.2.1 Some Generalizations 


First we treat the following two functional equations connected with (Q): 


(5.60) gy +x) — gy —x) =A(e +x) —-sy—x)), 
(S.60a) gy +x) + hy — x) =Ak(y +x) + €Q — x)), 
for x, y € V, where V is a linear space over a field F, with characteristic zero, 
2€F,andg,h,k,€:V > F. 
First we study equation (5.60). Let g : R — R be a quadratic functional 


(satisfying (Q)) and B be the associated biadditive B given by (4.1). The property 
B(Ax, y) = AB(x, y) forx, y € R, 2 € R gives 


qUx+y)-—qUx -—y)=Aq@a+y))-a@—-y), 
which is (5.60). 


We now determine the solution of the functional equation (5.60) without assum- 
ing any regularity condition in Theorem 5.16. Even though its solution is obtained 
in [44, 320], here we present in Theorem 5.16 a simpler and direct proof. 


Theorem 5.16. (Aczél and Dhombres [44], Gleason [320], Kannappan [465]). Let 
V be a linear space over a field F of characteristic zero. Suppose g : V > F 
satisfies the functional equation 


(5.60) g4y+x)-—gUy—x) =Agivyt+x)-giy—-x)) forx,yEV,1€ F. 


Then g — g(0) is quadratic (that is, it satisfies (Q)), and there exists a unique sym- 
metric, bilinear B : V x V — F such that g(x) = B(x, x) +c for x € V, where c 
is a constant. 
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Proof. If g is a solution of (5.60), then g+ is constant. So, without loss of generality, 
let us assume g(O) = 0. 

Putting 2 = 0 (or y = 0) in (5.60) gives that g is even; that is, g(x) = g(—x) for 
xe. 


Remark. If V = R = F in the theorem, then (5.60) with y = 1 yields g + x)— 
gA-—x) = Ag +x) -— gd —x)) = Aa(x). The fact that g is even and the 
interchange of 4 and x yield Aa(x) = xa(A); that is, a(x) = cx and g(A +x) — 
g(A — x) = cAx, which with x = / gives g(A) = cA? (using g(0) = 0), where c is a 
constant, or we have 


gtx)—-gd—x) =clx = 5G +5) =G=2), 


from which g(0) = 0 follows g(A) = cd” and we are done. 
To continue the proof, now replace x by wx in (5.60) and use (5.60) and the fact 
that g is even to get 
(5.60a) 
gay + ux) — gy — ux) =Au(gy+x)-—sy—x)), forx,yeV, Awe F. 


Let 2 = wand y = x in (5.60), and use g(0) = 0. We obtain (with u = 2x) 
(5.61) glu) =A7?g(u) forue V, AEF. 


Remark. When V = R = F in the theorem, (5.61) yields g(2) = kA* and we are 
done. 


To continue the proof, replacing x by x + y, putting 2 = 2 into (5.60a), and using 
g even, we get 


(5.62) gGBy +x) =2(g2y+x)—g())+g—v—x) forx,yeV. 


Replace x by 2x and y by y — x in (5.62) separately and use (5.61) to have 


(5.63) g(3y + 2x) = 8g(y +x) — 8g(x) + g(y — 2x) 
and 
(5.64) gy — 2x) = 2gQy — x) —2g(x) + g(y — 2x), 


respectively. First subtract (5.64) from (5.63) and utilize g even and (5.60a) to get 
(5.65) g(2y —x) = g(y +x) +3g(y —x)—3g(x) forx,yeV. 
Second, add (5.63) and (5.64) and use (5.65) twice to obtain 


gGy + 2x) + gGy — 2x) 
(5.66) = 8e(y +x) — 10g(x) + 2g(2y — x) + 2g(y — 2x) 
= 12(g(y +x) + 8(y — x)) — 16g(x) — 6g(y). 
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Finally, from (5.66), replacing x by 3x and y by 2y and using (5.61), (5.66), and g 
even, again we have 
36(g(y +x) + g(y — x)) = 12(g2y + 3x) + g(2y — 3x)) — 144g (x) — 24(y) 
= 12[12(g(y + x) + gy — x)) — 16g) — 68 (%)] 
— 144g(x) — 24g()); 

that is, g satisfies (Q). 

Then, by Theorem 4.1, there is a unique symmetric, biadditive B: V x V > F 
such that g(x) = B(x, x), where 

4B(x,y)=sat+y)—g—y) forx,yeV. 


The use of (5.60) shows that BUAx, y) = AB(x, y) forx,y € V, 2 € F. So B is 
bilinear. This completes the proof of the theorem. 


Remark. Note that the general solution of (5.60) has been obtained without assum- 
ing any regularity conditions on g. 


Finally, we determine the general solution of the most general equation (5.60a) 
connected with the Jordan—von Neumann quadratic functional equation in Theorem 
5.16a. 


Result 5.16a. (Kannappan [465]). The functions g,h,k,€ :V — F satisfy the 
functional equation 


(5.60a) gUy+x)+hOy—-—x)=Akt+x)+lQ—--x)), forx,yeV,A€ F, 
if, and only if, there exists an additive (linear) A and a symmetric, bilinear B such 
that 

g(x) = Bx, x) + AQ) +5, 

k(x) = B(x, x) + AQ) +0, 

h(x) = —B(x, x) + A(x) —b, 

€(x) = —B(x,x)+ AQ) -c, 
where b and c are constants. 


Now we proceed to consider three related functional equations connected to i.p.s. 
and obtain their general solutions, 


(5.67) g(Axt+y) +a(x — Ay) = (1+ A7)(2(x) + 2(y)), 
(5.68a) g(x ty) thx — Ay) = (1+ 27)(e(x) +4()), 
and 

(5.68b) g(Ax ty) +h(x — Ay) = (1+ 27) (x) + €0)), 


where g,h,k,€: V — F, V isa linear space over a field F andd € F. 
First we treat the functional equation (5.67). Even though it was solved in Rassias 
[683], here we present very simple and direct proofs. 
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Theorem 5.17. (Kannappan [464]). Let V be a linear space over a field F of charac- 
teristic zero (or characteristic different from 2). If g : V — F satisfies the functional 
equation (5.67) for allx,y € V and 2 € F, then g satisfies (Q) and there exists a 
symmetric, bilinear B: V x V — F such that g(x) = B(x, x) forallx € V. 


Proof. If 2 = 0 in (5.67), there is nothing to prove. Let A 4 0. 
Set x = 0 and y = 0 separately in (5.67) to have 


g(—Ay) = (1+ A7)g(0) + 27 2(y) = g(Ay) 


(change x to y in the second substitution); that is, g is even. Letting y = O in the 
above, we get g(0) = 0 and 


(5.68) g(Ax) =A°g(x) forx EV, AEF. 


Remark. If V = R = F, the proof is finished. 


Proof (i). With 1 = 1, (5.67) becomes (Q). By Theorem 4.1, there exists a symmet- 
ric, biadditive B : V x V — F such that 


g(x) = Bix,x) forx eV, 


1 
Bix, y)= zls@ +y)—g(x)—-g()] 
(5.69) 


1 
= qls@ +y)—g(—y)] forx,yeV. 


In general, B is biadditive only. Here we show that B is bilinear. 
Interchange x and y in (5.67) and use g even to have 


g(x +y)+ ee — Ay) = gy +x) + gly — Ax) 
or 


BiAx, y) = g(x + y)— g(x — y) 
(5.70) = gy +x) — gy —x) 
= Bix,dAy) forx,yeV, Ae F. 
Further, by (5.69) and (5.70), 


B(x, Ay) = 17 B(x, y) 
(5.71a) = B(I?x, y) 
= B(x,d’y) forx,yeV, AEF. 


Using (5.68), (5.70), (Q), and (5.71a), we show that B is homogeneous in the 
first variable. For x, y € V, 4 € F, we have 
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(+ 1)Pe@+y)+@—1)?8( —y) 
= 2(1 + A?)(g(x) + g(y)) + 24(g@ + y) — 8@ —y)) 
= 2(1 + 2°)(B(x, x) + BOY, y)) + 84B(x, y) 
and 
g(4+)a+y))+a(A4-DG-y)) 
= slg(2x + 2Ay) + g(2Ax + 2y)] 
= 2[g(x + Ay) + gUx+y)] 
= 2[B(x +dAy,x +dy)+ BUxt+y,Ax+y)] 
= 2[(1 + V?){B, x) + B(y, y)} + 2BG@, Ay) + 2B(Ax, y)] 
= 2(1 + 1°)[B(x, x) + B(y, y)] + 8BQx, y). 
Hence, B(Ax, y) = AB(x, y) forx, y € V, A € F. That is, B is bilinear. 
Proof (ii). From (5.69) and (5.68), we have 


B(Ax,x) = zle +1)? —(A-1)7]g(x) =AB(x, x) forx eV, Ae F. 


Replacing x by x + y in the above and using (5.70) and the symmetry of biadditive 
B, we get 
B(x, y) = AB(x, y); 


that is, B is linear in the first component. Hence B is bilinear, and we are done. 
Proof (iii). Replacing x by Ax and y by (A+ 1)y +x in (Q) and using (5.68), we get 
(A+ 1)?g +y)+9(4—- Dx — At Dy) = 2078 (x) + 2¢(A+ Dy +2). 


Changing x to —x in the above and subtracting the resultant from the above and 
using the evenness of g and (5.69), we have 


(A+ 1)°B(x, y) — BA — 1x, A+ Dy) = 2B(x, A+ Dy), 


which by the use of (5.70) and the biadditivity of B yield B(x, y) = AB(x, y). This 
completes the proof of this theorem. 


Before considering (5.67) on a field F of char F = 2, we study the remaining 
two equations. 
Now we consider equation (5.68a) and prove the following theorem. 


Theorem 5.17a. (Kannappan [464]). Suppose g,h : V — F, where F is a field of 
characteristic 0 (or different from 2), satisfy equation (5.68a). Then there is a unique 
symmetric bilinear B: V x V — F such that 


g(x) = Bx, x) +b, h(x) = BO, x)-), 


where b is a constant. 
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Proof. Putting 2 = 0 in (5.68a), we get g(x) — h(x) = c. This g(x) in (5.68a) yields 
(5.67). Hence we get 


h(x) + 5 = B(x,x) = g(x) - 7 


This completes the proof of this theorem. 


Finally, we prove the following theorem regarding (5.68b). 


Theorem 5.17b. [464]. The general solutions g,h,k,€:V — F, where F is a field 
of characteristic zero (or different from 2), of (5.68b) are given by 


g(x) = Ba, x) +b, h(x) = B(x, x)-), 
k(x) =B(a,x)t+tec, €(x)= Bix,x)-c, 


where B: V x V — F is asymmetric bilinear form and b and c are constants. 


Proof. Put 2 = 0 in (5.68b) to get €(y) — g(y) = c = h(x) — k(x). Substituting 
these values of f in terms of g and k in terms of h in (5.68b), we obtain (5.68a). The 
result sought follows from Theorem 5. 17a. 


Remark. Note that no regularity condition is used in all these theorems. 
We now consider the case where char F = 2. 


Theorem 5.17c. (Rassias [683], Dhombres [220]). Let V be a vector space over a 
perfect field F of characteristic 2 (other than Fy) and g : V — F satisfy (5.67). 
Then g is a solution of (Q) and B(x, y) = g(x + y) — g(x) — gQ)) is bilinear. 


Proof. Now (5.67) becomes 
(5.681) g(x +y) +e t+dy) = (1 +27)(8() + 86)). 


Now g(x+y)+8@—-—y)=s+y)+8@+y) =0= 2(g(x) + g(y)), and so g 
satisfies (Q). 

When g is a solution of (5.67), so is g — g(0). So, without loss of generality, 
assume g(0) = 0. 

Now x = 0 or y = 0 in (5.68}) yields (5.68), g(Ax) = A7g(x), for 1 € F, 
xeV. 
Define B: V x V > F by 


Bix, y)=g(e+y)—g()—g(y) forx,y eV. 


Claim B is bilinear. First we will show that B is homogeneous in each variable. Note 
that B is symmetric, and since char F = 2, —g(x) = g(x). 

It is easy to check that B(Ax, y) = B(x, Ay) and 4*B(x, y) = B(x, Ay) = 
B(A*x, y) for A € F, x,y € V. Since F is a perfect field of char F = 2, for every 
u € F there isa A € V such that w = 4*. Hence 


B(ux,y)=muB(x,y) forwe F, x,yeV. 


5.2 Geometric Characterization 283 


So, B is homogeneous in the first (and thus in the second) variable. Now we prove 
the biadditivity of B. 

We observe that B from its definition as a Cauchy difference satisfies the cocycle 
equation 


Bix, y)+ Bix+y,z)=BO,z)+Bw+z,x) forx,y,zEV. 
Choose 2 4 0, 1. Replace x, y by Ax, Ay and y by Ay separately to get 
1° B(x, y) + AB(x +-y,z) = B(Ay, z) + AB(Ay +z, x), 
AB(x, y) + B(x + Ay, 2) = ABY,z) + BUY +z, x), 
which by subtraction or addition of 4 times the second equation results in 
Bix +Ay,z) = Bix +y,z)+C0—-A)BQ,z) forx,y,ze V. 
Replacing x by 4x in the equation above and using it gives 


AB(x + y,z) = BUx+y,z) + —A)BO,2z) 
=(Bat+y,z)+ (0 — A)Bix,z)] + d —A)BO,2z) 
or 
dd —A)[B@ + y,z) — BQ, z) — BO, z)] = 9; 


that is, since 2 4 1, B is additive in the first variable. Hence B is biadditive, and 
this proves the result. 


5.2.2 Solution of an Equation Related to Ptolemaic Inequality 
In Alsina and Garcia-Roig [70], the authors, motivated by the Ptolemaic inequality 


lx — yl zl S Mle — cil yl + te — yi Med, 


forx, y,z € V, ann.Ls., studied the equation 


x y 
(5.71) fa-y=f@)fO)F (= = =a). for x,y (40) ER, 


in connection with i-p.s. 
Here we obtain the solution of (5.71) in the following result under slightly differ- 


ent conditions and then under the same conditions as in [70] by presenting a simple 
and direct proof. 


Theorem 5.18. (Alsina and Garcia-Roig [70], Kannappan [471]). Let f : RR > Ry 
be strictly increasing on R4 even and satisfy 


2x 
f@y 


and the functional equation (5.71). Then f(x) = c|x| for x € Rand c a positive 
constant. 


(5.72) ff ( ) =2, forx £0, 
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Proof. First y = x in (5.71) gives f(0) = f(0) f(x)? and f(0) = f(0)*. Since f 
is strictly increasing, f is 1-1 on Ry, f(0) = 0, and f(x) 4 0 for x ¥ 0. With 
y = —x, (5.71) using (5.72) and f even gives 


(5.73) f 2x) = f(x) ( =) =2f(x) forx ER. 


Set y = —3x in (5.71) and utilize (5.72), (5.73), and f even to get 


fx) Bx) fF ( = f(4x) =4f(x) 


3x ) 
f(x ad 7 Gey 


=2f (x) fx)f (= ne —— 


(2 by (Gx) f (=): that is, 


x 3x 12x 
aap Fae) = (Fer) 
since f(x) 4 0 for x 4 0. Then, since f is 1-1 on R™, either 
x 3x 12x 
FOP” FON? FEN 
which cannot be since f(x) > 0 for x 4 0, or 
x 3x 12x 
FOP” FOP FOP’ 
that is, since f(x) > 0 forx 4 0, 
(5.74) f@Gx)=3f(«) forx ER. 
From (5.73) and (5.74), we obtain for all integers m,n 
FO SE, 


The set {2”3” : n,m € Z} is dense in R™, and since f is strictly increasing (this 
is the only place we need this), f(x) = cx for x € R, and positive c. From f even, 
we obtain the solution sought. This completes the proof of this theorem. 


Remark. Suppose f (x) f (“> 
rem 5.18. Then 


X 
fx) = f@x —x) = fx)fa)f (=- ak 


—) = | for x 4 0 holds instead of (5.73) in Theo- 


that is, since f(x) # 0 for x 40, 


Xx 
FOx)F (Ga5- jan) 7 = Coal 


from which it follows that f(2x) = 2 f(x). Now (5.72) holds and we obtain the 
solution f(x) = c|x| as in [70]. 
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5.2.3 Orthogonal Additivity and I.P.S. 


There are several lines of approach for the characterization of i.p.s. among n.l.s. 
We have seen quite a few so far. We now turn our attention to geometry—in par- 
ticular, to dimension and orthogonality. A number of definitions of orthogonality in 
vector spaces, in addition to the usual one for i.p.s., are known in the literature. Four 
were mentioned earlier, and we now introduce two more. 


Definition. [693, 118, 692]. Orthogonality space. Let X be a real vector space of 
dimension > 2 and be a binary relation on X. Then X (L) is called an orthogonal- 
ity space provided possesses the following properties: 


(pl). x LO, O Lx forall x € X. 

(p2). Ifx,y € X, x, y #0, and x 1 y, then x and y are linearly independent. 

(p3). Ifx, y € X andx L y, thensx 1 ty foralls,t ER. 

(p4). If Y is a two-dimensional subspace of X and x € Y, A > O, then there is a 
y € Y such that x L y and (x+y) L (x -y). 


5.2.4 Diminnie Orthogonality 


Let X be a real vector space with Hamel dimension dim X > 2, X ) stand for the 
topological dual space of X, and F = {@ € X : ||A|| < 1}. For fixed x, y € X, 
define 


D(g, yw; x,y) = ox) wy) - dQ) v@) ford,yeF 


and 
|x, yl| = sup{D@, wix,y):¢,we F} forallx,ye xX. 


Definition. For x, y € X, x is said to be Diminnie orthogonal to y, in symbols 
x by, if 


lle, yl] = [aT My 


Several authors studied functional congruences applied to orthogonal pairs of 
independent variables, with orthogonality defined in some sense. We will see a cou- 
ple of results in this direction. The congruences fall under conditional equations. 


Definition. Let X (L) be an orthogonality space and G be an Abelian group. Then a 
mapping f : X — G is called orthogonally additive with respect to _L if 


(A’) faty)=f@)+f) forallx,y ¢ X withx Ly. 


Result 5.19. (Ratz [693]). Let X (L) be an orthogonality space and G be an Abelian 
group. Then f : X — G is an odd solution of (A’) if, and only if, f is a solution of 
(A). 
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Theorem 5.19a. (Ratz [694, 692]). Suppose X (1) is an orthogonality space, G is 
an Abelian group, and q : X — G is aneven solution of (A‘). Then q is quadratic; 
that is, q satisfies (Q). 


Proof. We will accomplish the proof in four stages by liberally using (p1) to (p4) 
and, in particular, (p3). 


Step 1. Forx, y ¢ X, («+ y) L @ — y) implies that g(x) = q(y). 


as) =4 (24) 
=0()+0() by (p3) and (A’) 
=«()+ (>) (q even) 
=((+*) wes 


=q(y). 


Step 2. g(2x) = 4q(x) for x € X. Now use (p4) and (p3). For x € X, thereisay € 
Y (atwo-dimensional subspace of X containing x) withx Ly, ~+y)l(@-—~y). 


qQx)=qa+y+x-—y) 
=qat+y)+qa—y) by (A’) 
= q(x) +q(y) + 4x) + 4(-y) _ by (A’) and (p3) 
= 2q(x) + 2q(y) 
= 4q(x) by Step 1. 


Step 3. Forx ¢ X¥, 2 ER, q(x + 4x) + q(x — Ax) = 2g (x) + 2g (x). 
If 2 = 0, there is nothing to prove. Let 2 4 0. By (p4), there exists a y € X with 
x Lyand (x+y) L (x — y), where 2 > 0. 


q(x + Ax) + q(x — Ax) 
=q(x+yt+ Ax —y)+q(x — Ax) 


= q(x +y)+qQAx—y)+4q@ — Ax) by (A’) 

= q(x)+40)+ 40x) +q(-y) +4 — Ax) by (A’) 

= q(x) + qx) + q(x — Ax) + 4q(y) — 24(y) 

= q(x)+ q(Ax) + q(x — Ax +2y) — 240) by Step 2 and (A’) 
= q(x) +qUx)+q@t+y)+q(-Ax + y) —2q(y) 

= 2q(x) + 2q(Ax). 


If A < 0, consider —/, and then the Step 3 relation holds. 
Step 4. Forx € X, r,s € R, q(x +sx)+q(rx — sx) = 2g (rx) + 2q (sx). 
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If either r = O or s = 0, it is trivially true. Letr,s A 0. 
RY RY 
g(rx +sx)+q(rx —sx) = q (rx + (rx) +4q ((rx) _ (rx) 


= 2q(rx) + 2q (= . rx) 
r 
= 2q(rx) + 2q (sx). 
To prove that q satisfies (Q), again we are going to use (p3) and (p4). 
Let x, y € X. If either x = 0 or y = 0, (Q) holds since g(0) = 0 and q is even. 
Letx,y £0. 
Let Y be a two-dimensional subspace of X containing x and y. From (p4), we 


see that there is az € Y such that x Lz, (vw +z) L (« —z). Since {x, z} is linearly 
independent, span {x, z} = Y. So, y = rx + sz for some r,s € R. Now 


qa t+y)+q(x —y)=q& +rx +sz)+q(e —rx — sy) 
=q((l +r)x) + 4(sz) 


+ q((1 —r)x) + q(-sz) by (A’) 
= 2q(x) + 2q(rx) + 2q(sz) by Step 3 
= 2q(x) + 2q(rx + sz) by (A’) and (p3) 


= 2q (x) + 2q(y); 
that is, g is a solution of (Q). This concludes the proof. 


Corollary 5.19b. [693]. Let X(L) be an i.p.s. and G be an Abelian group. Then 
f : X = Gis aneven solution of (A‘) if and only if there exists an additive mapping 
A:R-— G such that f (x) = A(||x||*) for x € X. 


Corollary 5.19c. Let X(1) be an i.p.s. and G be a uniquely 2-divisible Abelian 
group. Then f : X — G is a solution of (A’) if and only if there is an additive map 
A:R— Gandh: X => G such that 


f(x) =A((IxlI?) h(x) forx eX. 


Corollary 5.19d. Let X (1) be an i.p.s. and G a real Hausdorff topological vector 
space, and f : X — G is acontinuous solution of (A’). Then there is a continuous 
function h : X — Ganda constant c € G such that f(x) = cllx ||? + h(x) for 
xex., 


Now we determine all orthogonally additive functions with respect to a general 
orthogonality. Let V be a real i.p.s. with ip. (, ) and orthogonality L (x L y if 
(x, y) =0). A mapping f : V > R satisfying 


(A”) fax+y)=fa)+f), forx,y eV, withx 1 y, 


is called an orthogonally additive function on V (or f satisfies a conditional Cauchy 
equation with condition x | y). We determine all continuous solutions of (A”) in 
the following theorem. 
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Theorem 5.20. (Aczél and Dhombres [44]). Let V be a real i.p.s. of dimension > 2. 
A continuous function f : V — R is orthogonally additive (that is, it satisfies (A’)) 
if, and only if, there exists a continuous linear additive homogeneous A: V > R, a 
continuous bilinear B : V x V — R, anda constant c € R such that 


(5.75) f(x) = BQ, x) + A(x) = cllx|? + A(x), forx € V. 


Proof. It is easy to verify that f given by (5.75) is a solution of (A”). 
Conversely, let f be a continuous solution of (A”). Every function f : V > R 
can be written as 


f(x) = fe(x) + fox) forx € V, 


where fe(x) = LO) FCs) is called the even part of f and fo(x) = f@) fe") is 
called the odd part of f. Note that f. and f, are continuous and satisfy (A’””). We will 
show that fo is linear and fe satisfies (Q). We make use of the fact that in each 
two-dimensional subspace H there exist nonzero x, y in H such that x L y and 
(x + y) L ( — y). Note that x L y implies ax L fy fora, Bb ER. 


(i) To prove that f, is linear (that is, homogeneous and additive on V), it is enough 
to show that f, is linear on any two-dimensional subspace H of V. Now there 
exist x, y (4 0) € A such that x L y and (x+y) L (x — y). First, we show 
that fo(rx) =r f(x) and fo(ry) =rfo(y) forr € R. 


fo(2x) = fox +y+x—y) 


= fox +y)+ fo& —y) by (A”) 
= folx) + folv) + fox) + fo(—y) again by (A”) 
= 2 fo(x) since fy is odd. 


Similarly, fo(2y) = 2 fo(y). 
By induction, we show that f,(nx) = nfo(x) and f,(ny) = nfo(y), n € Z4. 
Suppose this result holds for allk, 0 < k <n. Then 


fo(nx+(n — 2)y) 
= fol —-I)a+y)+@— y)) 


= fol — 1I)@ + y)) + fol — y) by (A”) 
= fol — 1)x) + fol — ly) 
+ fox) + fo(—y) by (A”) 


= (n — 1)L fo(x) + fo(y)] + fo(x) — fo(y) by induction 
nfo(x) + (n — 2) foly) 
= fo(nx) + fo((n — 2)y) since nx L (n — 2)y. 


Hence f,(nx) = nfo(x), and similarly f,(ny) = nfo(y). Since f, is odd, this 
holds for all n € Z. 
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When x L y, so is * al ~ for all n (4 0). Thus 


& x x 1 
fo (n ; ~) =nfo (-) or fo (-) = — f(x). 
n n n n 
Hence, for any rational ae 
m m m m 
fo(Tx) = Tho) and fo (Ty) = Fo). 
n n n n 
Now invoke continuity of f, to obtain 


fo(rx) =rfo(x) and fo(ry)=rfo(Wy) forr €R, 
from which follows fo(rx + sy) =rfo(x) + sfo(y) forr,s € Rsincerx L sy. 
Let u,v € H. Then there exist a, 8, y,d € Rsuch thatu = ax + By, v= 
yx + dy. Let A € R. Then 
fol2u +0) = fo((aa + y)x + OB + 0)y) 
= (la +7) fol) + UB +d) folv) since (la + y)x L GB + d)y 
= Aafolx) + Bfolv) + 7 fol) + dfoly) 
= Afolax + By) + folyx + dy) 
= Af, (u) + fo(v). 


Thus, f, is linear on H and therefore on V. Take f, = A. 
(ii) To prove that fe satisfies (Q) on H, use the same technique as in (i). As before, 


fe(2x) = felx +y +x —y) 


= felx ty) + fe(x — y) use (A”) 
= fe(x) + fely) + fe(x) + fe(—y) again use (A”) 
= 2 fe(x) +2 fel(y) since fe is even. 


Similarly, fe(2y) = 2fe(x) + 2fe(y). Hence fe(2x) = fe(2y) and then 
fe(x) = fe(y) by applying them to 5 1 5. So, fe(2x) = 4fe(x) = 2? fe(x). 
Use induction to show that f.(nx) = fe(ny) = nf (x) forn € Z4. 
Assume that this relation holds for all k, 0 < k <n. Then, as before, 
fe(nx+(n — 2)y) 

= fe((n—1)(e+y)+ & —y)) 

= fe((n—1)(e+y))+ felx —y) apply (A”) 

=(- I'L) + £0) 


+ fe(x) + fe(-y) apply induction and (A”) 
= [2(n — 1)? + 2] fe(x) since fe is even 
= 2(n” — 2n + 2) fe(x) 
= fe(nx) + fe((n — 2)y) apply (A”) 


= fe(nx) + (n — 2)" fe(x). 
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Hence fe(nx) = [2(n? — 2n + 2) — (n — 2)*] fe(x) = n? fe(x). Similarly, 
fe(ny) = n? fe(x) = fe(nx). Since fe is even, fe(nx) = fe(ny) = n? f (x) for 
alln € Z. Applying this to + 1 ~, we get 


fe(=) = (2) = Sie) 
and then 
te (=x) = (“)" Fe) for all — €Q. 


Invoking the continuity of fe, we have 


fe(rx) = fe(ry) = r fo(x) forallr € R, 


from which follows 


fe(rx + sy) = fe(rx) + fe(sy) = r f(x) + s” fo(y) forr,s ER. 


Letu,v € H.Thenthere area, f, y, 6 € Rsuchthatu = ax+fy, v = yx+dy, 
and 


fe(u +0) + felu — v) 
= fe((a+y)x + (B+)y) + fea — y)x + (B - dy) 
= (a+ yp) fe(x) + (B +5)’ fe(y) + a — y) felx) 

+ (B-6)’fely) since (at y)x L (B+0)y 
=[(at+y)+(@a—y)]fe(x) + 1B +6) + B45) 1 fe) 
= 2(a° + y*) folx) + 2(B? + 5°) fe(y) 
= 2[felax) + fe(By) + felyx) + fe(dy)] 
= 2fe(u) + 2 fe(v); 


that is, f2 is a solution of (Q) on H and therefore on V. Then, by Theorem 4.1, 
there is a biadditive B : V x V — R such that fe(u) = B(u,u) foru € V. 
By the continuity of f., B is bilinear (that is, ani.p. on V). 

Indeed, forr € Q, rB(u,v) = B(ru,v) = sl fe(ru +0) — fe(ru) — fe(v)]. 
So, if 2 € Randr —> J, then AB(u, v) = B(Au, v). 

Since V is ani.p.s., if x L y, since fe is orthogonally additive, B(x, y) = 
5(fe(x + y) — fe(x) — fe(y)) = 0. Let u € V and S be a two-dimensional 
subspace containing u with {e;, e2} as its orthonormal basis. Then (e; + e2) L 
(2124), 
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fe(e1) = te (< —_ + a =) 


7 (2 =*) A (4 =") by (A”) 

-n(2)n(2)r0(2)+e(B) way 
1 

= al feler) + fe(e2)] 


since fe is even. Similarly, 


1 
fe(e2) = al feler) + fe(e2)] = fele1)=c (say). 
Let u = ae; + feo fora, BER. 
fe(u) = Blu, u) 
B(ae, + Ber, ae; + fer) 
a B(eq, ey) + B°Bler, e2) since B(e,,e2) =0 
a’ fe(e1) + B fe(er) 
c(a? + B?) 


2 
cllull”. 


This proves (5.75) and concludes the proof of this theorem. 


Corollary. [f f in Theorem 5.20 is an odd solution of (A"), then f is additive. If f 
is an even solution of (A"), then f is quadratic; that is, a solution of (Q). 


Now we consider & , Birkhoff-James orthogonality. 


Result 5.21. [218]. Let V be a real n.l.s. of dimension at least 2. Let f : V > R 
satisfy 


(A) faty=f@)+fO) forx,y €V withx By 


(that is, orthogonally additive with respect to 18). Then the odd part fo is additive 
and the even part fe is zero. The norm of V comes from an i.p. if, and only if, (A“”) 
has a nonadditive solution. 


Diminnie Orthogonality 


There are cases in which characterizations of i.p.s. among n.1.s. underlie a dimension 
condition. A certain uniqueness property of the Diminnie orthogonality works only 
for dimension at least three but does collapse for dimension two. The following 
result in [702] shows that if orthogonal additive maps instead of the orthogonality 
relation itself are used, the two-dimensional case is no longer exceptional. 
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Result 5.22. A real n.l.s. X with dim X > 3 is ani.p.s. <8 is symmetric 12 is 
right unique. 

Let V be areal n.l.s. with dim V > 2, 12 the Diminnie orthogonality in V, and 
G an Abelian group. A mapping f : V > G is said to be orthogonally additive with 
respect to 2 (or satisfy a conditional Cauchy functional equation) provided 


(Al) f(x +y) — f(x) + FQ) forx,y € V with x by. 


Result 5.23. (de Rham [702]). Suppose f : V > G satisfies (A””). Then the odd 
solutions of (A”") are precisely the additive mappings from V — G, every even 
solution is a solution of (Q), every solution of (A) is additive = 0 is the only even 
solution of (A””"), and V is an i.p.s. <> not every solution of the equation (A) with 
G = Ris additive. 


Remark. While the relations 8 and 2 are not sufficient for a two-dimensional real 
n.l.s. to be an i.p.s., the condition concerning (A’”) is. 

Some More Generalizations 

We consider two generalizations of (5.25) and (5.25a). 


Theorem 5.7d. Let V be a real linear space and ¢,a, y : V — R. Then these 
functions satisfy the functional equation 


n 


(5.254) Dd Gi —¥) = Dl ai) + v@) 
i=l 


i=1 
n 
for X1,X2,-.-5Xn €V, xX = ees and fixed n > 3 if, and only if, 


d(x) = B(x, x) + A(x) +, 
(5.26a) a(x) = B(x,x) + A(x) +c, 
w(x) = —n(BQx, x) + A(x) +1 — 0), 


for x € V, where A, A, satisfy (A), B is a symmetric, biadditive function, and c, c, 
are constants. 


Proof. Setting xj = x2 = ---X, = x in (5.25d) yields 

(5.27a) w(x) = —na(x)+nc, where d(0) =c. 

As in Result 5.7a, we consider two cases according to whether n is even or odd. 
First let n be even, say 2k. Let x) = +++ = xp =X, Xep1 = +++ = X2R = YIN 

(5.25d) to have 


kg (=) Likp (=) =OLeGyey (=) 
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or 
(5.29d) d(x —y) + ¢(y —x) =a(2x)+a(2y)—2a(x+y)+2c forx,yeV. 


When n is odd, say 2k + 1, putx, = +--+ = xR = X, Xe =e = XR = 


y, Xe¢1 = 4, ¥ = * in (5.25d) to get 


wo ( ) +e (2 *) +00 


= kas) + kay) +0 (S*) + wv (2) 


Xx 


wilt w 


=ka(x)+ka(y) +a ( 


= 2k+ 1a (4 +"), 


*) + (2k + 1)6(0) 


which is the same as (5.29d). 
So, we solve (5.29d), which can be rewritten as 


(5.76) a(u +0) +.a(u — v) = 2a(u) + 28(0) 
with x + y =u, x —y =p, and 
(5.77) 2B(x) = o(x) + O(—x) — 2c, forx eV. 
Note that (5.76) can be written as (putting u = 0 in (5.76)) 

a(u +o) +a(u —v) = 2a(u) + a(v) + a(—v) — 


which is (4.22), and, by Theorem 4.22, there is an additive A and a symmetric, 
biadditive B such that 


a(u) = Blu,u)+A(u)+c, forue V, 
2B (x) = a(x) + a(—x) — 2c = Ox) + P(—x) — 2c = 2B(x, x), 


and thus 
w(x) = —n[B(x,x) + A(x) +c, -— Cc]. 


It remains to determine ¢. Putting these values of a and y in (5.25d), we obtain 


Y 9s —*)= x B(xi, xi) + A@i) + c1] + ne — n[B, x) + AG) +1] 
i=1 


2 aia 
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that is, 


> [¢@i -—%) — Bei — Xx) —X] = 1c 


i=1 


or 
n 


pac? —X)=nce, 


i=1 
where t (x) = $(x) — B(x, x) for x € V and t(0) = c; that is, 


n 


Dy t(yj) = Nc, 


i=1 


n 
where yj = xj —X, i = 1 ton with >) y; = 0 (see Theorem 1.76). 


Puty; =x, y= y, y3 = —(X +9), y4 =0=--- = yn to have 
t(x) + t(y) +t(—x — y) =3c (which is (A)). 
Set y = 0 to get f(x) + t(—x) = 2c or t(—x) = —t(x) + 2c, and then 
t(x) +t(y) =t@+y)+c_ (which is (A)). 
So, there is an additive A; such that 
t(x) = Ai(Qx) +c = f(x) — BO, x). 


Hence 
$(x) = Bx, x) + Ale) +e. 


This proves (5.26a) and the result. 


6 
Stability 


In this chapter, the stability problem is treated in general. In particular, stability of the 
multiplicative function, logarithmic function, trigonometric functions, vector-valued 
functions, alternative Cauchy equation, polynomial equation, and quadratic equation 
is treated. 

S.M. Ulam, in his famous lecture in 1940 to the Mathematics Club of the Uni- 
versity of Wisconsin, presented a number of unsolved problems. This is the starting 
point of the theory of the stability of functional equations. One of the questions led to 
a new line of investigation, nowadays known as the stability problems. Ulam [807] 
discusses: 


... the notion of stability of mathematical theorems considered from a rather 
general point of view: When is it true that by changing “a little” the hypoth- 
esis of a theorem one can still assert that the thesis of the theorem remains 
true or “approximately” true? ... 

For very general functional equations one can ask the following question. 
When is it true that the solution of an equation differing slightly from a 
given one, must of necessity be close to the solution of the given equation? 
Similarly, if we replace a given functional equation by a functional inequal- 
ity, when can one assert that the solutions of the inequality lie near to the 
solutions of the strict equation? 


Suppose G is a group, H(d) is a metric group, and f : G > H. For anye > 0, 
does there exist a 6 > 0 such that 


d(f(xy), FO)F(y)) < 46 
holds for all x, y € G and implies there is a homomorphism M : G — H such that 
d(f(x), M(x)) <e forallx € G? 


If the answer is affirmative, then we say that the Cauchy equation (M) is stable. These 
kinds of questions form the basics of stability theory, and D.H. Hyers [382] obtained 
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the first important result in this field. Many examples of this have been solved and 
many variations have been studied since. Several investigations followed, and almost 
all functional equations are “stabilized”. It is impossible to give a summary of all the 
results known in this growing area. This chapter is designed to provide an introduc- 
tion to the formulation and solution of such problems. At the same time, we present 
different methods of solving problems of this type. We present as many interesting 
results as possible. Good sources are the book by Hyers, Isac, and Rassias [385] and 
the survey articles by Székelyhidi [798] and Ger [310]. 


6.1 Stability of the Additive Equation 


We now present Hyers, Isac, and Rassias’s [385] valuable contribution to Ulam’s 
problem in the following theorem. 


Theorem 6.1. [385]. Let X and Y be Banach spaces (X could be n.l.s.) and f : 
X — Y be such that 


(6.1) If@+y)- f@)—- FO) <4 


for some 6 > Oand for all x, y € X. Then the limit 
(6.2) A(x) = lim 2-" f(2”"x) 
n> oo 
exists for each x in X, and A: X — Y is the unique additive function such that 


(6.3) If@) — AG) < 6 


for any x € X. Moreover, if f (tx) is continuous in t (€ R) for each fixed x € X 
(that is, if for each x € X the functiont — f (tx) from R to Y is continuous for 
each fixed x), then A is linear. Moreover, if f is continuous at a point in X, then A 
is continuous everywhere in X. 


Proof. For any x € X, the inequality 


(6.4) (5) re9- 10 


(3) 


(6.5) 2" f2"x) — f@)| = A -2™)6. 


holds. Now, by induction, 


Indeed, replace x by 2x in (6.4) to have 


[5722 = fen) = 53 
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that is, 
| sfx) —2f(@)— fQx)+2fG@)| s ° 
or 
lS (2?x) — 2f(x)] - If 2x) -2f@)I < 0 
meaning 


1 
sfx) — f(x) 


a FO") ~ £0) 


5 Bali 1 
< —_ — 
2 22)? 
from which follows 


1 1 1 1 
<0 BY oa FO ae = eo) > 
which is (6.5). 


Next we show that the sequence ea f @"x)| is Cauchy for each x € X. Choose 
m >n. Then 


1 
mm 


1 1 
by (65) <5 a(1 - =) 


1 
2m =n 


FQ" . 2"x) _ f (2"x) 


af 2") = a fo"n| = 


A 


Thus the sequence is Cauchy and, since Y is a Banach space (complete), there exists 
a limit function A : X — Y such that (6.2) holds. 

The next step is to show that A is additive. Replace x and y in (6.1) by 2”x and 
2”y to get 
1 1 
—— <— > 
Qn 


1 1 
| sf 2"& +9) — = F@"x) - =; 


fQ"y) 


forn € Zi, x,y € X. Letting n — ov, we observe that A is additive from the 
inequality above, and from (6.2) we obtain (6.3). 

Our next goal is to show that A is unique. Suppose A; : X — Y is another 
additive function satisfying the inequality (6.3). Then, for x € X, 


1 
IAG) — Ai) = —A@x) — fx) — Ailex) + Fx) 
26 . 
<— (using (6.3)), 
n 
showing thereby that Aj = A. 


Finally, suppose f is continuous at y € X. Let {x,} in X be such that x, — 0 as 
n —> oo. Then, for any integer m € Z7., 
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|AG@n + y) — ADI = A Gal 


1 
= —[llAGnxn + y) — fmxn + y) 


+ fxn +y)- f0)+ FO) -ADDII 
2o+e 


m 


(by (6.3) and continuity of f aty) < 


for sufficiently large n and any integer m; that is, A is also continuous at y. 
For a fixed x € X, if f(tx) is continuous in f, then it follows that A(fx) is 
continuous in ¢ and hence A is linear. This proves the theorem. 


Several remarks are in order. If f in the theorem is continuous at a point in X, 
then the corresponding A is continuous everywhere in X. This pioneering result of 
D.H. Hyers can be expressed in the following way. The additive Cauchy functional 
equation 


fa+y)=fa)+fQ) 


is stable for any pair of Banach spaces. The function 


Gyr faty)—fa)—-fO) 


is called the Cauchy difference of the function (see Chapter 13). Functions with 
a bounded Cauchy difference are called approximately additive mappings. The 


fQ2"x) 
Qn 


This theorem is an immediate consequence of the following result of Polya and 
Szeg6 [670] with Y = R. 


sequence { | is called the Hyers-Ulam sequence. 
Theorem 6.2. For any real sequence (ay) satisfying the inequality 
(6.1a) l€ntm — An —Am| <1, neme Zi. 


there exists a finite limit 


and 
la, —nA| <1, ne Zi. 


Theorem 6.1 is an immediate consequence of Theorem 6.2 (in the case where 
(Y, lll) = QR, |-). Actually, arbitrarily fix an.x € X and put ay, = (4) f(nx), ne 
ns Then (a,,) satisfies (6.1a) because of (6.1), whence, on setting 


aw, (9\_ DY e. (FRY 
A(x) = im (2) = (2) sim, ( k ). 


ro) — nA(e) 


we get 


<1 


6.1 Stability of the Additive Equation 299 


for all n € Z% and all x € X or 
5 x a 
Fe) —néA (=)| <6. 
n 
Since, in view of (6.1), for every x, y € X one has 


a 
n 


J5A(x + y) — JA(x) — 6A(y)| = Tim, 


ar) 
< lim -—-=0, 


n>oon 


consequently 
f(x) -— AQ@)| <6 forallx eX. 


There are two possible ways to generalize Theorem 6.1. One of the natural ways 
is to generalize the domain; the other is to have different bounds on the right side of 
(6.1). 

It is possible to prove a stability result similar to Hyers for functions that are 
not necessarily bounded Cauchy differences. Mainly using the technique of Hyers, 
Th.M. Rassias [678] gave the following generalization of Hyers’s result. 


Result 6.3. [678, 682]. If f : R > R is a real map satisfying 


(6.6) If@ +y¥) — Ff) — FO) s OUI? + lyl?) 


for some 6 > 0, p € [0, 1), and for all x, y € R, then there exists a unique additive 
function A: R = R such that 


If(~) —A@)| 


yap 


forallx €R. 
Remark. p = 0 in Result 6.3 yields Theorem 6.1. 


Remark 6.4. Result 6.3 is true for all real p except p = 1. Gajda in [302] proved 
the following result. Let X be a normed space and Y a Banach space. Assume that 
p > 1is areal number, and let f : X — Y be a mapping for which there exists an 
é > O such that 


If@ +y)— F@)— FODI S eCilxll? + lly ll?) 
holds for all x, y in X. Then the limit 


A(x) = im 2” f(2-"x) 
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exists for all x in X and A : X — Y is the unique additive mapping that satisfies 


If@)- AG) S 


€ 
2? —2 
for all x in X. 

That Result 6.3 does not hold for p = 1 is shown in [302] by the following 
counterexample. In [296], the following example of a bounded continuous function 
f :R- Rsatisfying 


(6.6a) ferry) =7@)—F7 Oe lel lyl fora, ye K, 
with 

im £ — 

im = CO 

x>0 XxX 


behaving badly near 0, is constructed. For a fixed 6 > 0, let the function f : R— R 
be defined by 


fx) = D292"), x ER, 


n=0 
where the function ¢ : R > R is given by 
79 ifl<x<o, 


b(x)=4e0x if -1<x <1, 


and f satisfies the inequality 


(6.6b) If + y) — F(x) — FO) s A(x] + ly) 


for allx, y € R. But there is no constant 6 € R+ and no additive function A: R > R 
satisfying the inequality 
If (x) — A@)| < olx| 


forallx eR. 
Rassias and Semrl [682] also constructed a continuous function f given by 


xloga@+1) ifx > 0, 


F@) = eae ifx <0, 


satisfying (6.6a) for all x, y € R with 
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It follows that the set 
PW= AO, | 
|x| 

is unbounded for any linear mapping A : R — R defined by A(x) = cx fora given 
c € R. In other words, an analogue of Result 6.3 cannot be obtained. 

The following general result of the type above is given by Th.M. Rassias and 
P. Semrl [686]. 
Result 6.5. Let 6 : Ry x Ry — Rx be a positive homogeneous function of degree 
p #1. Let X be a normed space and Y be a Banach space, and further assume that 
the function f : X — Y satisfies 


If@+y)— FQ) — FOI S Ala yD 


for allx, y € X. Then there exists a unique additive mapping A: X — Y satisfying 
Il f@&) — A@)I| < All|]? 


for all x in X, where 


7 {ie for p <1, 
on forp> 1. 

The following theorem of J. R&tz [691] shows that a far-reaching generalization 
of the range is still possible. We remark that a nonempty set S with a binary operation 
* is called a power-associative groupoid if x"*" = x™ * x" holds for any natural 
numbers n and m and any element x in S. 


Result 6.6. Let S be a power-associative groupoid and let Y be a sequentially com- 
plete topological vector space over the rationals. Further, let V be a nonempty 
bounded convex subset of Y containing the origin, and let f : S — Y be a function 
satisfying 

flee] = fat *y") 
and 

fa@*y)—f@)—FfO)EV 
forall x,y € S, for some integer k = 2, and for all n in Z*.. Then there exists an 
additive function A: S — Y with f(x) — A(x) € V forall x,y € S. If Y isa 
Hausdorff space, then A is unique. (Here V denotes the sequential closure of V.) 


As J. Ratz [691] pointed out, power associativity does not imply commutativity, 
even if S is a semigroup. 

There are several possible ways to generalize Theorem 6.1. A natural way is to 
generalize the domain X. The following generalization is a consequence of a more 
general result of J. R&tz [691]. 


Result 6.6a. Let S be a semigroup with the property that for each x, y in S there 
exists an integer n > 2 with 

(x y)" = x". oe, 
Further, let Y be a Banach space. Then Cauchy’s functional equation is stable for 
the pair (S,Y). 
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6.2 Stability—Multiplicative Equations 


Theorem 6.7. (Baker [92]). Let 6 > 0, let S be a semigroup, and let f : S + C be 


such that 
(6.7) Ify)— f@)FO)| <6 forallx,yeS. 
Then either 
1+/1+46 
If(x)| < ne 


forallx € S or f is multiplicative for all x,y € S. 


Proof. Notice that e7 — ¢ = 6 and ¢ > 1. Suppose there exists a € S such that 
| f(a)| > e, say | f(a)| = ¢ + p for some p > 0. Then 


If (a) = If @ - F@? — f@))| 
>|f@1-|If@? - f@)| 
>If@P-6 
=(e+p)’-6 
a @+p)+ Ce=—latp- >e+2p. 

By induction, | f(a?’)| > ¢ + (n+ 1)p foralln = 1,2,.... 
For every x,y,z € S, 


If@yz)- f@yFf@l <6 and |f(yz)— fa) fOz)| < 4, 


so that 


fy) f(z) — FOS Oz) = |fryz) — fa) fz) + fay) f(@ — f(ryz)| < 26 


and 
If ay) f(2) — fOFOFI < Fey -— FA)FOz)I 
+ If Q)F Oz) — fOCDFO)F()| 
< 264+ |f(x)|d 
or 


If xy) — FO)FO)-IF@)| S26 + |F@)Id. 
In particular, 
26 + | f (x)|6 
Ifa") 
for all x,y € S and alln = 1,2,.... Letting n — +00 shows that f(xy) = 
f(x) fQ) forall x, y € S; thatis, f is multiplicative. 


If@ey)-— F@)FOI Ss 
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Remark. The crucial step in this proof is the multiplicative structure of complex 
numbers. Because the quaternions and Cayley numbers enjoy the same property, 
it is easy to see that Theorem 6.7 is also true for functions f with values in the 
quaternions or Cayley numbers. For real normed algebras for which the norm is not 
multiplicative, the situation is not so simple, as the following example shows. Let 
5 > 0, and choose ¢ > 0 so that |e — ¢?| = 6. Then e # 1, and let 


e~ 0 
f@ms= i | for x real. 
Then, with the usual matrix norm, 


Ifa +y)— FOFOII = 4 


for all real x and y, and f is unbounded, but it is not true that f(x + y) = f(x) fQ) 
for all real x and y. 


Remark. Theorem 6.7 is true when the codomain is a normed algebra with the 
multiplicative norm instead of C. 


Result 6.8. (Shulman [738]). A measurable complex-valued function f ona locally 
compact group G satisfies the condition 


sup ess sup| f(xy) — f(x) f(y)| < 00 
xEG yEeG 


if and only if f is either essentially bounded or equal to a continuous multiplicative 
function almost everywhere. 


6.3 Stability—Logarithmic Function 


We note that a semigroup S is called left (right) amenable if there exists a left (right) 
invariant mean on the Banach space of all bounded complex-valued functions defined 
on S. A left (right) invariant mean is a bounded left (right) translation-invariant linear 
functional taking the value | on the constant | function. 


Result 6.9. (Székelyhidi [793, 789]). Let S be a left amenable semigroup and let 
M be a left invariant mean on the space of all bounded complex-valued functions 
defined on S. Let f : S — C be a mapping satisfying 


If@y)— f@)— FO) Se 
for all x, y in S. Then the function L : S — C defined by 
L(x) = Myl fry) — FO) 
for any x in S is the unique logarithmic function for which 
IIf@) -L@)Il se 
holds for any x in S. 
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From this result, we have immediately that Cauchy’s functional equation is stable 
for the pair (S, C), where S is a finite semigroup, or a commutative semigroup. 
The following result explains the role of the Hyers-Ulam sequence (see [285]). 


Result 6.9a. (Forti [285]). Let S be a semigroup and Y a Banach space. Further, let 
f:S— Y beamapping satisfying 


(6.8) If@y)-— f@)—-fOll se 
for all x, y in S. Then the limit 
Qn 
a= Bae re 


exists for all x in S and L : S — Y is the unique mapping satisfying 


If@) -—L@)Il sé 


and 
L(x’) = 2L(x) 


for all x in S. If S is commutative, then L is logarithmic. 


Result 6.9b. (Ger [309]). Let S be a left amenable semigroup, Y a reflexive Banach 
space, and p : S — [0, oo[ an arbitrary function. If the function f : S — Y satisfies 


If@y) — f@)— FOI Ss 00) 


for all x, y in S, then there exists an additive function A : S —> Y such that 


II f@) — A@)Il < p@&) 
holds for all x in S. 


Result 6.10. [738]. A measurable complex-valued function f on an amenable 
locally compact group G satisfies the condition 


sup ess sup| f(xy) — f(x) — f(y)| < 0 


xeEG yEeG 


if and only if f = ®©+L, where L is logarithmic for all x, y in G and ® is essentially 
bounded. 


The problem of stability of algebra homomorphisms also leads to the study of 
systems of functional equations. A result in this direction is given in [132]. 


Result 6.11. (Bourgin [132]). Let X,Y be normed algebras, where Y is complete. 
If f : X — Y is surjective and the functions (x, y) +> f(x + y)— f(x) — fQ) and 
(x,y) f(xy) — f()fQ) are bounded, then f is an algebra isomorphism of X 
onto Y. 
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The stability of the equation of multiplicative derivation of this functional equa- 
tion has been studied in [731]. Let X be a Banach space and 5(X) denote the algebra 
of all bounded linear operators on X. 


Result 6.12. (Semrl [731]). Let X be a Banach space and let A be a standard 
operator algebra on X. Assume that y : Ry — Rx is a function such that 


t+>+oo ¢ 
If f : A B(X) is a mapping satisfying 
If@y) — xf) — F@)yll < wall yl) 
for all x, y in A, then f satisfies 
f(xy) = xf) + yf @) 


0. 


forallx, yin A. 


6.4 Stability—Trigonometric Functions 


First we start off with the d’Alembert equation (C). 

Theorem 6.13. (Baker [92]). Let 6 > 0, let G be an Abelian group, and let f be a 
complex-valued function defined on G such that 

(6.9)  [f@ty) + f@-y)—2f@) FO) <6 forallx,y €G. 

Then either 


Fic ee a 


or there exists a complex-valued function E on G such that 


{E(x) + E*(x)} 
2 


forallx €G 


f@)= forallx EG 


and 


[E(x + y)— E(xX)EQ)| < forallx,y €G. 


NIG 


Proof. We will use a few times the inequality 
Iz — w| >= |Iz|—|wl| forz,w eC. 
Setting x = 0 = y in (6.9), we have 
2) —2F)"| <4 


or 


21 f0)*| — 21f O)| < 6; 
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that is, 


1+ /1+26 
(6.10) Oe (say). 


Note that 2(e? — ¢) = d and 2¢ > 1. 
Put y = x in (6.9) to get 


[f (2x) + FO) — f(x)*| <6. 
Using this and (6.10), we have 
[f(2x)| = |2f (x)? — FO) — {2f@)? — FO) — f2x)} 
> |2f (x)? — fO)| -— |f@x) — FO) — 2f (x) 
> |2f(x)? — FO 1-4 


> |2f@)*1-— | FO -46 
> ANf(«~)P—e-6. 


If there isa b € G with | f(b)| > e, then 
|f(2"b)| > co asn— ow. 
Let | f(b)| = ¢ + p for some p > 0. Then 
2|f)I? — [fF O) - (¢ +6) = 2 + py? — (€ +p) — (€ +4) 

= 2(e? —¢) + (4e — lp + 2p? —6 
= (4e — 1)p + 2p? > 3p. 

Hence 

2|f I? — (e +6) > [f+ 3p > e+2’p. 


So, 
| f(2b)| = 2 f @)P — (e+) > e +2’p. 


By induction on n, 
[f(2"b)| > e+2"*"p for alln. 


Hence f is unbounded and | f(2”b)| > co asn > oo. Withx = b, y= 0in (6.9), 
we have 2| f (b)| |1 — f(O)| < 6. Since f is unbounded, it follows that f (0) = 1. 
For any x,y € G, 


21f@)1- If) — fy) = 2f@) FO) —2f@)FCy)I 
= [2fQ)fQ) — f@+y)— f&— y)I 
+If G+ y+ I@—y) —2fQ)F Cy) 
< 206 
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(as we see by replacing y by —y in (6.9)), and since f is unbounded, it follows that 
f (—y) = f() for all y € G; that is, f is even. 
Letting y = x, (6.9) yields 


[fQx)+1—- f(x] <6. 


If f(2x) = 1 forall x € G, then |2 — f(x)?| <6 forall x or|f(x)?| < 2+ for all 
x € G, contradicting the unboundedness of f. So, there is ana € G with f(2a) 4 1 
and a € C such that 2a*(f (2a) — 1) = 1. 

Define 


g(x) = f(x +a) — f(x —a) 
and 


(6.11) E(x) = f(x) +ag(x) forallx €G. 


Since f is even, 


g(—x) = f(—x +a) — f(-x —a) 
= f(x —a)— f@ +a) 
= —g(x). 
That is, g is odd. Further, 
E(—x) = f(—x) + ag(—x) 
(6.1 1a) = f(x) —ag(x). 
Adding (6.11) and (6.1 1a), we get 


fey = EOC) 
as asserted in the theorem. 
Let 
(6.12) 2f@)FO)=faty+fa-—y)+tG@,y), 
so that 
It(x, y)| <6 


for all x, y € G. Next we compute (see Chapter 3) 


2f(x)g(y) + 2fQ)g) = 2f@)fO +4) —2f@)fFO — a) 
+2fO)f@ +4) -2fO)F@ — a) 
=f+y +e) + fe—y—@) +1, y+ 4) 
= (eri =a=fe=y haya y =8) 
+fa+yt+a)+fO-—x-a)+i@+a,y) 
= (ya —a)— {Ys +a)—1K —a,y) 
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=2f(x+y+a)—2f(xt+y—a)+ f—y-a) 
—fyy-x+a)+fQ—x-a)— fyy—x+a) 
+t, y +a) —t(x, y—a)+t(x +a, y)—t(x —a,y) 

=2g(a+y)+t,y+a)—t(x,y—-a) 
+t +a, y)—t( —a,y) 


since f is even. Hence, we have 


2g(x + y) — 2f (x)g(y) —2f (a(x) 
=t(x,y—a)—t(x,y+a)+t(x—-—a,y)—-t(e+a,y). 


Since |t(x, y)| < 6, we have from the equation above 


(6.13) lex +y)— f(x)g(y) — fO)g(x)| < 26 


for allx, y € G. 
Similarly, using (6.12), we obtain 


2g(x)g(y) = 21f@ +4) — fx -—DIIfO +a) -fO—-4)] 

=2f(x +a)f(yt+a)—2f@+a)f(y—a)—2fa—-—a)ft+a) 
+2f (x —a) f(y —a) 

= f(x+y+2a)— f(x—y)+t@+a,y+a)— f+y) 
— ff —y+2a)+t@+a,y—a)— f@+y)— f(x —y —2a) 
—t@-a,yt+ta)+f@+y—2a)+ f@—y) +t —a,y—a) 

=2f(—y)-2f@+y)t+f@tyt+2a)+ faty—2a) 
— f(x —y+2a) — Ff —y—2a)+t@+a,y+a) 
—t(x+a,y—a)+t(x -—a,y —a)—t(* —a,y+a) 

=2f@—y)-2faty)+2faty)f a) 
—taw+y+2a,x+y—2a)—2f(« — y)f (Qa) 
+t(x—y+2a,y—x —2a)+t(x +a,y+a) 
—t(x+a,y—a)+t(x —a,y—a)—t(—-a,y+a). 


Hence 


2g (x)g(y) — 2(f 2a) — DIF @ +y¥) -— fF — y)] 
=tx —y+2a,x —y—2a)—t(xe+y+2a,x+y —2a) 
+t(x+a,y+a)—t(x+a,y—a) 
+t(x-—a,y—a)—t@—a,y+a). 


Since |t(x, y)| < 6, we get 


(6.14) Is@@)g(y) — Fa) -— DFO + y) — F@ — y))| S 30. 
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Further, we get 


fa+y+f@-y)|— 


(6.15) F@)FO) - 5 = 


O 
5 
Now, using (6.13), (6.14), and (6.15), we compute 


JE(xe + y)— E@)EQ) =If@ +y) tag@+y) — f@) FO) — a78(x)g0) 
— af (x~)g(y) — af Y)g(x)| 
< |a|-le@+y)— f@)gQ) — FO)g@)| 
+|f@+y)— f@FO)- a7 s@)geO)| 


< 3)alo+|f@ +y) — a7 g(x)g(y) 


_f@tyt+fa—y) 
2 


< s)a6+2 4 [FE +9=FE—y 


9 
2 


-—a*(f(@)- DIf@t+y)-f@- » + 2\a|6 


1 
= 3 (21a ae lgiP 7) 


for all x, y € R. Since f is unbounded, a can be chosen so that 


|f(a)| and |f(2a)| 


can be as large as desired. Thus @ can be chosen so that |a| is as small as we wish. 
Thus we must have 


x 9) —m@)mGy)| ° 


for all x, y € R. This completes the proof of the theorem. 


From Theorems 6.7 and 6.13, we easily obtain the following. 


Result 6.13a. Let 6 > 0, let G be an Abelian group, and let f be a complex-valued 
function on G such that 


(6.9) If@+y)+ fa —y)-2f@)fO)| <6 forallx,y eG. 
Then either 
|f(x)| < ee forallx €G 


or 


(C) faty)+ f@—y)=2f@)fO) forallx,y €G. 
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If we let 


1+4/T4 20 
f= for all x € G, 


then (6.9) holds but (C) fails. Thus the estimate in Theorem 6.13a is optimal. Also 
notice that if f is sufficiently uniformly small or sufficiently uniformly close to 1, 
then (6.9) holds. 


Using well-known results concerning the cosine equation (see, e.g., [424]), we 
obtain the following corollary to Theorem 6. 13a. 


Corollary. Let 6 > 0 and let f be a complex-valued, Lebesgue measurable function 
defined on R” such that 


If@+y)+ f@—-y)-2f@)FO) <4 forallx,y eR". 


Then either 


+ /1 +206 


1 
If()| < 5 forall x € R" 


or 


f(x) —cos(c-x) forallx €R’", 


where c is an n-tuple of complex numbers and 


n 
cx = > CiXj 
i=1 


ifc = (cj,...,Cn) and x = (X1,...,Xn). 


Badora [82] gave a new, shorter proof of Baker’s result, Theorem 6.13, which we 
will adopt in Theorems 6.14 and 6.15 investigating the stability of (6.16) and (6.17). 
The following result is proved in [82], and an example shows that the theorem is not 
valid in the case of vector-valued functions. 


Result 6.13b. (Badora [82]). Let ¢,6 > 0 and f : G > C (Ga group) satisfy the 
inequality (6.9) and 


Ifa@t+ty+z2)-f@e+z+y)| <0 forx,y,7zEeG 


(similar to the (K) condition). Then f is either bounded or satisfies d’Alembert’s 
equation (C). 


Counterexample. [82]. Let G be a group and f : G > M2(C) be defined by 


fa) = c. ie zee. 


¢ 
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where g : G — C is an unbounded function fulfilling equation (C) and c (€ 1) € 
R*. Then 


If@+y+ f@—y)—2f@)FO)Il = constant > 0 forx,y €G. 
This difference is bounded, but f is neither bounded nor satisfies (C). 


We now investigate the stability problem of the functional equations 


and 
(6.17) fat+y)+ f@—y)=2f@)g), x yEG, 


which are generalizations of (C) (refer to (3.27)). 


Stability of (6.16) and (6.17) for Complex Functions 


In this section, we will consider the stability of the equations above. First we will 
take up (6.16) and prove the following theorem. 


Theorem 6.14. (Kannappan and Kim [484]). Let ¢ > 0 and f, g : G > C satisfy 
the inequality 


(6.16a) If(xty)+ fe —y)—2g)fO)| <¢, 


with f satisfying the (K) condition, where G(+) is a group. Then either f and g 
are bounded or g satisfies (C) and f and g satisfy (6.16) and (6.17). Further, in the 
latter case there exists a (homomorphism) exponential E : G — C* satisfying (E) 
such that 


b 1 
(6.18) FG) = (EG) + E(-x)) and g(x) = 5(E@) + E(-x)) 


for x € G, where b is a constant. 


Proof. The proof is modelled after [82]. We will consider only the nontrivial f (that 


is, f #0). 
Put y = 0 in (6.16a) to get 
If@)-—8@fOl<5 forx eG. 


If g is bounded, then using the above we have 


IF@) = If) -— gO FO + g@) FO 
< = + Ig(x) FO), 
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which shows that f is also bounded. On the other hand, if f is bounded, choose yo 
such that f (yo) 4 0 and then use (6.16a) 


f(x + yo) + f& — yo) FS OS IE SI oy a. 
2f (yo) 2f (yo) ~ 21 f (yo) 


to get that g is also bounded on G. It follows easily now that if f (or g) is unbounded, 
then so is g (or f). 

Let f and g be unbounded. Then there are sequences {x,} and {y,} in G such 
that g(x,) #0, |g(xn)| > coasn > coand f(y,) £0, lim | fOn)| = oc. 

First we will show that g indeed satisfies (C). From (6.16a) with y = yy, we 
obtain 


Ig@)I— 


= 


FOE + I+ IE AI) _ oy 2 é . 
2f On) ~ 21 f On)| 
that is, 
(6.19) fie a) 


ut 2f (Yn) 
Using (6.16a) again and (K), we have 
|f(x + (y + yn) + f@& — + yn)) 


—2g@)f + yn) + fe + — yn)) 
+ f(x — (vy — yn)) — 28) f(y — yn)| S 2¢, 


so that 
f(&+y) + yn) + f(&% + y) — yn) 4. f(@ — y)+ yn) + f(& — y) — yn) 
2f On) 2 f (Yn) 
fy + yn) + f — yn) 
2 ee fe 
a 2FOn) “Fool oe 


which with the use of (6.19) implies that 


lex +y)+ g(x — y) — 2g(x)g(y)| < 0; 


that is, g is a solution of (C). 
As before, applying (6.16a) twice and the (K) condition, first we have 


(6.19a) fin fQ@n+y)+ fn —y) = fO) 
u 28 (Xn) 


and then 


[f(G@n +x) +y) + f(Gn + x) — y) — 28 +x) f() 
+ f(Qn —x)+y) + f(On x)—y) 28(Xn x)f(y)| < 2e, 
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so that 
fn + (& +y)) + fG@n — & + y)) iat @=y) Fi Ge— CO =—y) 
SS SS eS 
22 (Xn) 28 (Xn) 
B(Xn +X) + B(%n — x) é 
260) 1 Teer 


From (6.19a) and g satisfying (C), it follows that 


f(x +y)+ fx — y) — 2g) f(y)| < 9; 


that is, f and g are solutions (6.16). 
Choose yo such that f (yo) 4 0. Then (6.16) gives 


f(x + yo) + f@ — yo) 
2f (vo) 


so that g also satisfies the condition (K). Since g satisfies (K), from [424] and 
Theorem 3.21, we see that there exists a homomorphism (exponential) E : G > C* 
satisfying the second part of (6.18). 

Finally, applying (6.16), (6.19), and (K), we get 


g(x) = 


|f (Qn ty) +x) + f(On + y) — x) — 2gGn + y) f(x) 
+ f(@n —y) +x) + f(G@n -— y) — x) —2¢Gn — y)f@)| < 2e 


and that 
F@at+@+y))+ f@n-—@+y)) ai fQ@n+@—y))+ fn -@—y)) 
28 (Xn) 2g (Xn) 
8(%n+y)+8Gn—y) é 
29 popes SA Se SEE a 
Fe 28 (@n) ~ TeG@nl’ 


resulting in (6.17). From (6.16) and (6.17), it is easy to see that f(x) = bg(x) for 
some constant b. This proves the theorem. 


Now we take up the stability of (6.17). We prove the following theorems. 
Theorem 6.15. Let ¢ > 0 and G be a group. Suppose f,g : G — C satisfy the 
inequality 
(6.17a) If@+y)+ f@—y)—-2f@)sO)l Se forx,yeG, 


with f even (that is, f(—x) = f(x)), and f satisfies (K). Then f and g are either 
bounded or unbounded, and g satisfies (C) and f and g are solutions of (6.17) and 
(6.16). 


Proof. We consider only nontrivial f; that is, f 4 0. When f is bounded, choose 
Xo such that f (xo) 4 0 and use (6.17a) to get 
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ini Ifo +y) + fo = yl [Got WT FMO=Y) _ ey 
2|f (o)| ~ 2 f (xo) 


é 
2 
~ 2) Ff Go|’ 


which shows that g is also bounded. 
Suppose f is unbounded. Choose x = 0 in (6.17a) to have 


If) + f(-y) — 2fOsO) S & 


that is, 


IFO) - FOODS 5 


(this is the only place where we use that f is even). Since f is unbounded, f(0) 4 0. 
Hence g is also unbounded. 
Let f and thus g be unbounded. Then there exist sequences {x,} and {y,} in G 


such that f (in) #0, |f(n)| > 00, gyn) £9, |g (yn)| > 00. 
Twice applying the inequality (6.17) and using (K) for f twice, first we get 


fn ie). < é : 
2 f (Xn) ~ 2|f Gn)| 
that is, 
. fGnty+fOn-y) _ 


Then we obtain 


If (Gn +x) +y) + f(Gn +x) — y)—2f Gn + x)8(y) + f(Gn—x) + y) 
+ f(@n — x) — y) —2f On — x)g(y)| S 2e: 


that is, 
fG@n+@+y)+ fGn-@+y)) 4 f@n + @ —y)) + fG@n- &@ —y)) 
2G) 2f (Xn) 
fQ@n+x) + fGn — x) g 
=9 EN ee a 
oe 2F Cn) = TFG 
which by (6.20) leads to 


lex +y) +g —y)— 2g(y)g(x)| < 0, 


so that g satisfies (C). 
Again applying the inequality (6.17a) twice and using the (K) condition for f 
twice, first we have 
f@+yn) + f(& — yn) é 


6.21 i a ee ee eee , 
a 280m) I.) = ooo 
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and then we get 


f(x + Ont+y)) + f£@ — On+y))-2f@)gOn + y) + f+ On - y)) 
+ f@— On —y)) —2f(®)8 On — y)| S 28; 


that is, 
f(&%+y) + yn) + f(&% + y) — Yn) 4. f(a — y) + yn) + f(@ — y) — yn) 
2g(yn) 28 (yn) 
80nt+y)+8On—-y) é 
a) oh eee 
f@) 2g(yn) lgQn)I : 


which by (6.21) and (C) yields 


If@+y) + f£@ —y)—2f(@)gQ)| < 9, 


so that f and g are solutions of (6.17). 
Consider the inequality 


If (On +x) +y) + f(On +) — vy) —2¢0n +x) £0) + (On — x) + y) 
+ fF (Qn — 4) — y) — 2¢0n —*) FQ) S 2¢. 


As before, using (K), (6.20), the evenness of f and (C), and the division by 2g (yn) 
yields 


f(a +y)+ yn) + f(@ + y) — yn) ae f(x —y)+ yn) + f(x — y) — yn) 
28 (yn) 28 (Yn) 
g(yn +x)+ 8(Vn —x) 


Seay, asn —> OOo, 


so that f and g are solutions of (6.16). This completes the proof of the theorem. 


Note that the evenness of f is used to prove that g is unbounded when /f is and 
nowhere else. 


6.4.1 Stability for Vector-Valued Functions 


Badora [82] gave a counterexample to illustrate the failure of the superstability of 
the cosine functional equation (C) in the case of the vector-valued mapping (see the 
counterexample above). Here consider the following example. Let f and g be the 
unbounded solution of (6.16) (or (6.17)), where f, g : G > C. Define f{, g1 : G > 


M2(C) by 
see") je w=). 


for x € G, where c} #0, c2 4 1. Then 


Ifi@ +y) + fie — y) —2fi(y)gi()|] = constant > 0 
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(or || fi@ + y) + fi — y) — 2fi@)giGy)|| = constant > 0) for x, y € G. This 
fi and g; are neither bounded nor satisfy (C). Therefore there is a need to consider 
the vector-valued functions separately. We prove the following two theorems in this 
section. Let G be a group and A be a complex normed algebra with identity. 


Theorem 6.16. [484]. Suppose f, g : G > A satisfy the inequality 


(6.16b) IF@+y) + FO —y)—2@)FOI se 
forx,y €G, with f satisfying (K) and 
(6.22) If@)-fCxyIl <n, forx eG, 


for some e,n > 0. Suppose there is a z € G such that g(zo)~! exists and 
\| f («)g (zo) || is bounded for x € G. Then there is an E : G — A such that 


(6.23) |Ea+y)-E@)EQ)|| <a, forx,yeG, 
and 
(6.23a) [#09 - 5(E@) + E(—x))| <a, forx €G, 


for some constants a, and a2. 


Proof. Let m := sup,eg || f(x) g(Zo)||. Then, using (6.16b) and (6.22), we get by 
using (K) 


foie ol iF Hecate Ceca 
<M + Sf 0-2) + eo +8) ~2e(e0) F(X) 


— (f(-zo +x) + f(—z0 — x) — 2g(—z0) f )) 
—(fG@ot+x)— f(-zo-x) + fo — x) — f(-zo + x))II 
<m+eée+y. 


Define a function h : G > A by the formula 
h(x) = 5(F) + f(—x)) forx €G. 
Then / is even (that is, h(—x) = h(x)) and 
(6.24) |n(x) — F@)I S 2 forx eG,  ||h(x)g@o)|| <m. 


Define a function E : G > Aby 


E(x) =h(x)+ig(zo) forx €G. 
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Utilizing (6.24), we get (first using commutativity in A) 
JE(x + y)- E@)EGV)Ih 
= ||A@e + y) Hig(zo) —h&)hAG”) Fi(A@) +h(y))g(z0) + 8(Zo)* ll 
<|A@ + y)Il + AG)AG)I + IA) + A(~))8(Zo)Il + Ig (Zo) ll + Ig Zo)II* 
< lla +y)— fe + y+ UF @ +y)I + IAG)AC) 80)? + g(o)7 I 
+ lh(x)g(Zo)ll + Qe (Zo)I + Ile @o)ll + le (Zo)II 
+ ml||g(zo)||~ | +m lle (zo)ll-? + 2m + [1g (zo)ll + lg o)II? 


= 


(say), which is (6.23). Finally, by (6.24), we have 
1 
[se = 5 E@) a B-x)| 


1 
= |r — h(x) + h(x) — 5) + h(—x)) — tg Zo) 


IA 


1 
4 + llg(zo)ll = a2 


(say), which is (6.23a). This proves the theorem. 


Lastly, we prove the following result. 
Result 6.16a. Let f, g : G > A satisfy the inequality 


(6.17b) If@+y+f@—-—y)-2f@sOl se xy EG, 
with f satisfying (K) and 


If@)-fCx)llsn, forx eG, 


for some nonnegative ¢ and yn. Suppose there exists azo € G such that g(zo)~! exists 
and || f (x) g(zo)|| is bounded over G. Then there exists a mapping E : G + A such 
that 

|Ea+y)-E@EQ)Il <a, forx,y eG, 


and 


fe)- 5B) 2b-a)| Su, HSS. 


for some constants a, and a2. 


The proof runs parallel to that of Theorem 6.16. The following result, a slight 
variation of Theorem 6.13, also holds. Let ¢ > 0 and f, : G > C be a sequence of 
functions converging uniformly to f on G. Suppose f, g, fn : G > C are such that 


(6.16b) If@+y)+fa-y)—2g@) false, xy €G, 
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holds with f satisfying (K). Then either f is bounded or g satisfies (C) and f and g 
satisfy (6.16) and (6.17). 
The following extension of Theorem 6. 13a is true (Székelyhidi [788]). 


Result 6.17. Let G be a group, F a field, and V a vector space of functions from 
G into F, invariant by right translation. Let f, g : G — F be functions for which 
(K) holds for all x, y,z in G and the function x +> f (xy) + f (xy7!) —2f (x)g(y) 
belongs to V for all y in G. Then either f belongs to V or g satisfies (C) for all x, y 
inG. 

If here we take V as the space of bounded functions and f = g, then we get 
Theorem 6.13a. We remark that if S is a semigroup, F is a field, and V is a linear 
space of F-valued functions on S, then we say that the functions f, g: S — F are 
linearly independent modulo V if Af+yg € V implies 4 = w = Oforanyd, uw € F. 
We say that V is two-sided invariant if the functions x +> f(xy) andx h f(yz) 
belong to V for any f in V. 


Result 6.17a. Let S be a semigroup and V a two-sided invariant linear space of 
F-valued functions on S, and suppose that f, g : S — F are linearly independent 
modulo V. If the function x +> f(xy) — f@)g(y) — f)g(x) belongs to V for all 
y in S, then 

Fay) = fO)8) + FO)8@) 


holds for all x, y in S. 
Result 6.17b. Let S be a semigroup and V a two-sided invariant linear space of 


F-valued functions on S, and suppose that f,g : S — F are linearly independent 


modulo V. If the functions x +> f(xy) - f@)f)+g@)gQy) andx Pe f(xy) - 
f (yx) belong to V forall y in S, then 


Fay) = FO)FO) — 8@)8Q) 
holds for all x, y in S. 


6.5 Stability of the Equation f (x + y) + g(x — y) = h(x) k(y) 


(6.18) fa+y)+g@—y)=h(@)k(y). 


Result 6.18. (Székelyhidi [791]). Let f, g,h,k : G > C (Ga group) be functions 
for which the function (x, y) > f(x +y)+ g(@ — y) — h(x)k(y) is bounded.Then 
there are an exponential E : G — C, an additive function A : G > C, bounded 
functions a,b,c: G — C, and constants a, B, y, 6 such that we have the following 
possibilities: 


G) f,g,h,k are bounded; 
Gi) f, g are bounded, h = 0, k is arbitrary; 
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(ii) f, g are bounded, h is arbitrary, k = 0; 
(iv) f is bounded, g =aBE+b, h=aE, k= BE7'; 
(v) f =aPE +a, gis bounded,h =aE, k= BE; 
(vi) f= zaAt+a, g=—-jaA+b,h=a,k=Atc; 
(vil) f =4BA+a, g=SfBA+b, h=Atc, k=f; 
(viii) f = faBA? + S(ad+ By)A+a, g =—jaBpA? + 45(ad—- fy)A+b, 
h=aA+y,k=fA+6; 
Gx) f= 5(ay + Po)Ee+ 3(ad + By )Eo ta, h=ak.+ fEo, 
g =4(ay — BO)Ee — 4(ad— By Eo +b, k= y Ee + 5Eo; 


where E. and Ej are even and odd parts of E, respectively. 


6.6 Stability of the Sine Functional Equation 


Result 6.19. (Cholewa [157]). Let 6 > 0, let (G,+) be an Abelian group that is 
2-divisible, and let a function f : G — C satisfy the inequality 


(6.19) Ifa +yf@—y)— f+ fOY1 <4 


unbounded. Then f satisfies the equation 


(6.17) fa +ty)+ f@—y)=2f@)s0) 
for allx, y € G, where 


f(@ +a) — fe —a) 
2 f(a) 


forall x € Gand for some a € G with | f (a)| = 4, and further f is a solution of the 
sine functional equation (S). 


g(x) = 


6.7 Stability—Alternative Cauchy Equation 


We are now going to deal with the stability of the alternative Cauchy equation 


(6.25) If@+yl=If@)+fO)|  forx,y €G, 
(6.26) If@+ yl =IFG)+ FO) forx,y EG, 


where G is a commutative semigroup. 
We say that a partially ordered vector space V is a Riesz space if sup{x, y} exists 
for all x, y € V. We define the absolute value of x by the formula 


|x| = sup(x, —x) > 0. 
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V is called Archimedean if x < 0 holds whenever the set {nx | n € Z%} is 
bounded from above. The lexicographically ordered plane is an example of a non- 
Archimedean Riesz space. 


Definition. The sequence {/,,} in V is said to converge e-uniformly to the element 
ft € V whenever, for every ¢ > 0, there exists a natural number no such that | f — 
Jn| < ee holds for alln > no. The sequence { f;,,} in V is called an e-uniform Cauchy 
sequence whenever, for every ¢ > O, there exists a natural number n, such that 
| fin — fn| < ee holds for all m,n > 74. 


Let us point out that if V is Archimedean, the e-uniform limit of a sequence, if it 
exists, is unique. 

We say that the set W C G is weakly bounded if and only if for arbitrary x € 
G\ {0} there exists ann € Zi such that 


kx €W fork >n. 


Clearly every bounded set in a normed space is weakly bounded. 


Result 6.20. [106]. Let G be a commutative semigroup and let W C G be a weakly 
bounded set. If a function f : G > R satisfies the inequality 


(6.26a) f@+yl-lf@)+fO)| <6 


for some 6 > 0 and every x, y € G\W, then there exists a unique additive function 
A:G-— R such that 


| f(x) — A(x)| < 36 forx EG. 


Result 6.20a. (Batko and Tabor [106]). Let G be a commutative semigroup and let 
W c G be a weakly bounded set. Let K be an Archimedean, e-uniformly complete 
Riesz space. If a mapping F : G —> K satisfies the inequality 


(6.25a) [IF @ + y)l-|F@x) + FO)I| <e 


for some e € K+ andevery x,y € G\W, then there exists a unique additive mapping 
A:G-— K such that 


|F(x) —A(@x)| <3e forx eG. 


Result 6.21. [106]. Ler $ be a real interval with 0 € S, ¢ € (0,4), and let 
f :S = R be a function satisfying 
fa +y)—f@)-fo)eZ+[-¢,¢] forx,yeS, 


withx + y € S.If f is Lebesgue or Baire measurable, then there exists ac € R such 
that 


f(x)-cx € Z+4+[-e,€] forx eS. 


6.8 Stability—Wave Equation 321 


Result 6.22. (Bean and Baker [107]). Let f : R > Rand f be Lebesgue measur- 
able. The Cauchy difference of f, f (x+y) — f(«)— f(y), takes integer values only 
if there exists an additive A : R — R such that f — A is Lebesgue measurable and 
takes integer values only. 


6.8 Stability—Wave Equation 
Suppose f : R* — Ris such that 
(6.27) f(eth,y)+ fe —h,y)— f@,y +h) — f(x,y —h) =0. 
If f : R? — Risacontinuous solution of (6.27), then 
f@,y) =a(x,y)+B(—y) forallx,y ER, 


where a, 8 € R — R are continuous (Brzdek and Tabor [138]). 

A simpler proof of this fact was given by Haruki [353]. In [132] it is shown 
that if a, 8 : R — R are arbitrary functions and B : R* — R is biadditive and 
skew-symmetric, and if f : R* > R is defined by 


f@,y)=a(x+y)+Bb@—y)+ Bo,y) forallx,y eR, 


then (6.27) holds. McKiernan [629] showed that this is indeed the general solution 
of (6.27) by transforming it into an equivalent equation as follows. For f : R* > R, 
define g : R* > R by g(x, y) = f(x+ y,x — y) forall x, y € R. Then f satisfies 
(6.27) if and only if g satisfies 


gath,y+h)—gath,y)—g@,yth)+eg,y)=0 forallx,y,heR. 


Result 6.23. [107]. Suppose G is an Abelian group, X is a Banach space, 6 > 0, 
and f :G x G — X such that 


(6.27a) If@th,y+h)—farth,y)—fasyth)t+ fo, yl so 


for allx,y,h € G. Then there exist functions a, B : G—> X andB:GxG— X 
such that B is biadditive and skew-symmetric and 


If @,¥) — lax) + B(y) + BG, yl < 200 forallx,y €G. 


Result 6.23a. [107]. Suppose f : R* — R is Lebesgue measurable on R*, 6 > 0, 
and 


If@th,yth)—farth,y)—fa,yth)t+f@,y)| 26 forallx,y,heR. 


Then there exist Lebesgue measurable functions 9, y : R + R such that 


If, y) — {gp(x) + wQ)}| < 606 


forallx,y ER. 
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6.9 Stability—Polynomial Equation 


Now we consider stability results for the polynomial equation and for Fréchet’s equa- 
tion. We will use the standard notation for the difference operators. If S is a semi- 
group, V is a vector space, and f : S — V is a function, then the difference A, f of 
f by y € V is defined by the formula 


Ay f(x) = f(xy) — f@) 


for all x in S. As S is not necessarily commutative, we should rather call A, f the 
right difference of f by y. The composite operator Ay, Ay,--- Ay, is denoted by 
— yn IE y1 = yo = +--+ = Yn, then we use the notation AY for Ay yy,..yn- 
A function p : S — V is called a polynomial of degree n — 1 if 


(6.27b) A® p(x) =0 


holds for all x, y in S. For instance, any solution p : S — V of the Fréchet- 
functional equation 


(6.27c) Ay 1, y25-0¥n D(X) = 0 


is obviously a polynomial of degree n — | (see Chapter 13). 

If S is a semigroup and V is a topological vector space, then, as above, we can 
say that equation (6.27b) is stable for the pair (S, V) if, for any function f : S > V 
for which the function (x,y) t+ A‘ f(x) is bounded, there exists a polynomial 
p:S — V of degree at most n — 1 for which f — p is bounded. Similarly, we say 
that Fréchet’s equation is stable for the pair (S, V) if, for any function f : S > V for 
which the function (x, y1, y2,.-.5 Yn) F® Ayy,yo,...,y, £ &) 18 bounded, there exists a 
solution p : S — V of Fréchet’s equation for which f — p is bounded. The main 
result of D.H. Hyers’s paper [384] is the following. 


Result. Jf X is a rational vector space and Y is a Banach space, then Fréchet’s 
functional equation is stable for the pair (X,Y). 


Actually Hyers requires the boundedness of the mapping 


(x, Yi; y2> er) Yn) b> Ay, y2,-0yn FX) 


only on a convex cone in X. 
A generalization of Hyers’s theorem is the following. 


Result. [f S is a commutative semigroup and Y is a Banach space, then Fréchet’s 
equation is stable for the pair (S, V). 


The method of proof of this theorem is similar to that of Hyers applied in [384]. 
Székelyhidi [788] gives a short proof of the theorem above in the case Y = C using 
invariant means. By the same technique, he proves the following theorem in [776]. 
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Result. If S is an amenable semigroup with identity, then Fréchet’s equation is stable 
for the pair (S,C). 


Together with equation (6.27b), one can consider the monomial equation 


(6.274) AN f@) =n! fy) 


for functions f : S > Y, where S is a semigroup and Y is a linear space. Solutions 
of (6.27d) are called monomials of degree n. Obviously, any monomial of degree n 
is a polynomial of degree at most n. 

Some of the stability results for different functional equations we have considered 
so far are special cases of the following general result for linear functional equations 
[776]. 


Result. Let G be an Abelian group and let B;,‘¥; : G — G be homomorphisms 
for which the range of ®; is contained in the range of ‘¥i (i = 1,2,...,n + 1). Let 
St. fi: G @ C be functions for which the function 


n+l 
Gy) f@)+ >. fi(®i@) + ¥i0)) 


i=] 
is bounded. Then there exists a polynomial p : G — C for which f — p is bounded. 


Among others, Cauchy’s functional equation, Jensen’s functional equation, 
Pexider’s functional equation, the quadratic equation, the polynomial equation, and 
the monomial equation belong to the scope of this theorem. 


6.10 Stability—Quadratic Equation 


The first result is due to Skof [750]. Thereafter many authors investigated the stabil- 
ity problem of the quadratic equation (Q) in various settings. 


Theorem 6.24. Let G be an Abelian group and let X be a Banach space and f : 
G — X. If the quadratic difference 


(x.y) > Jeb) SF 6 = 7) — 27 @) —27 0), jorx,y eG, 


is bounded, there exists a quadratic function g : G — X for which f —q is bounded; 
that is, for a fixed 6 > 0, if 


(6.28) If@+y)+ f@—y)—2f(@)-2fO)I So, forx,yEG, 


there exists a unique quadratic mapping q : G — X such that 


(6.28a) ll f@)—q@)|l < ° forallx €G. 
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Moreover, the function q is given by 


gn 
(6.29) g(x) = lim | f( 2 for all x, 
n—>0o qn 


and if G is a real normed space, f is measurable, or f (tx) is continuous in t for 
each fixed x € G, then q(tx) = t?q(x) forallx €G, t ER. 


Proof. The proof runs parallel to that of Theorem 6.1 (see [750, 186, 158]). 
Set x = 0 = y in (6.28) to have 


0 
0)|| < =. 
IFOl 5 
Letting x = y in (6.28), we obtain 


Il f2x) —4 Ff) — FOI < 4; 


that is, 
fx) —4f@)ll — IFO) < 6 
or 
ay O 
(6.30) I: — f (2x) — f(x) = 5° p< 1 forx EG 


Replace x by 2x in (6.30) to get 


[;7e9 - fen 


that is, 


22% 


[z2r@%) - ro) + 70) - Fre) or 


| 1 


or 


< ay. 5 (by (6.30)) 


2 
git 24) — fe) S oes 


By induction, 


(6.31) IZ fQ"x)=— f(x) 


qn 
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Now we prove that 


f(2"x) 
qn 
is a Cauchy sequence for each x € G. Choose m > n. Then 


—_—_— f 2" n -2"x) — f(2"x) 


ny m i 
are fe" =z 


4m—n 
1 o 1 
= al! geal CFOS) 


6(1 1 
~ 24 amy 


325 


Thus the sequence is Cauchy and, since X is a Banach space, converges to a limit 


function, say g : G — X, which proves (6.29). 
The next step is to show that g(x) is quadratic. Now 


Iqa@+y)+q@—y)- si — 2q(y)Il 


= lim A Fats + 2”y) 


n>oo Gl 
+ f(2"x — 2" y) —2f (2"x) — 2f 2"y)|| 
< lim & >0 asn->oo (by (6.29)). 
noo 4 
Hence q is quadratic. 
The next goal is to show that (6.28a) holds. 


2”"x) 
jas) Fey = | tim SE — poo] 
= im pes - Foo] 
an) 
= (by (6.31)) 
_o 
_2 


Hence (6.28a) holds. 


Finally, we prove the uniqueness of g. Suppose g : G — X is not unique. Then 


there exists another quadratic function t : G — X such that 


P) 
It) f@)l <5 forallx €G. 


Note that 


IIt(x) — gl < lle) — FDI + IF) — a @)Il 
6 Oo 
< c 


= o> 
=5 
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Therefore 
It(x)-—q(x)|| <6 forallx €G. 
Since a quadratic function is rationally homogeneous of degree 2, we have 


n’t(x) n?q(x) 
rn 


1 
Slie(nx) — g(nx)l 
6 


<= -0 asno-ow. 
He 


It) — @@) = 


n 


t(nx)  q(nx) 
a) 


n n 


Thus, we get 
t(x) =q(x) forallx eG. 


Therefore g is unique. 


Result 6.25. (Czerwik [186]). Let X be a normed space and let Y be a Banach 
space. Ifa function f : X — Y satisfies the inequality 


If@+y)+ f@ —y)-—2f@) -—2fO)Il < O(a? + IlyI?) 


for p > 2, some 0 > 0, and x, y € X, then there exists a unique quadratic function 
q such that 


If@) qos 


If p = 2, then the result is no longer valid. In [186], the following result was 
proved. 


|x ||P, x EX. 


Result 6.25a. [186]. Suppose the function f : IR — R is defined by 
CO 
f(x) = 924-79 2"»), 
i=Q 


where the function ¢ : R > R is given by 


J? — fix| 21, 
#00) = [i if lxl <1, 


with a constant 0 > 1. Then the function f satisfies the inequality 


If +y)+ fe — y) —2f (%) —2f()| < 3200? + y’) 


for all x, y € R. Moreover, there exists no quadratic function q : R — R such that 
the image set of x~*| f (x) — q(x)| (for x 4 0) is bounded. 
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It is true that a mapping g : X — Y (X areal normed space, Y a real Banach 
space) is quadratic if and only if 


lg + y) +q@ — y) — 2g(x) — 2¢(y)|| > 0 as |lx]] + |lyl] > 0. 


7 


Characterization of Polynomials 


In this chapter, polynomials are characterized, mostly of second and nth order. 
Divided difference, Rudin’s problem and generalization, Rudin’s problem on groups, 
Frechet’s result, and polynomials in several variables are treated. 


7.1 Polynomials 


Polynomials are essential tools for studying continuous functions. Several (results) 
characterizations of polynomials are known. In this chapter, starting with Lagrange’s 
mean value theorem, we show that polynomials can be (defined) characterized by 
using many tools and through functional equations. The study of polynomials 
on semigroups is based on the notion of multiadditive functions. We show that 
functional equations are very useful tools to study polynomials (see Aczél [31], 
Sh. Haruki [359], Sahoo and Riedel [722], Aczél and Kuczma [54], Kannappan 
and Crstici [483], Kannappan [462], and Cross and Kannappan [180]). With the 
help of many functional equations, we give several characterizations in general of 
polynomials of degree n and in particular quadratic polynomials or polynomials of 
degree 2. 

Lagrange’s mean value theorem is one of the important results in differential 
calculus. It is known that (refer to Aczél [31] and Sh. Haruki [359]) if f : R > Ris 
differentiable, then f satisfying the functional equation 


f@ +h) — f@) 


(7.1) 7 


= f'(x+6h), 0<6<1, 
holding for x € Randh € R%, has the solution 
(7.2) f(x) =ax*+bx+c forx eR, 


a polynomial of degree at most 2, if and only if 0 = 5. 


(It is easy to see thatif f has the form (7.2), then f is a solution of (7.1), provided 
d= 5: Conversely, suppose 0 = 5 in (7.1). Since f has a derivative, differentiating 
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the left side of (7.1) with respect to x, we see that from the right side of (7.1), f has 


a second derivative and so on. Thus f has derivatives of all orders. With 6 = 5; 


(7.1) becomes 


fxeth)= f(x) +hf’ («+ 50). 


Differentiate with respect to h and replace x by x — a to obtain 
i h h, ; 
Flstspash @t+ FG). 


Then set x = 0 to have f’(h) = 2ah + b. Thus f has the form (7.2). 

Soon we will see that differentiability can be eliminated to obtain the same con- 
clusion (7.2). 

It is easy to see that (7.1) with 6 = 5 can be rewritten as 


f@)-fY) _ 
x—-y 7 


(7.3) r() forx,y ¢R, x #y. 


Lagrange’s mean value theorem for differential calculus says that if f : R > R 
is differentiable and [x, y] is any interval, then there exists a mean value 7 € |x, y[ 
(that is, 7 is a function of x and y) such that 


(1.4) PORTO) Fons») 


Xx 


for x, y € R with x # y. The first statement of this result appeared in a paper by the 
physicist Ampére (1775-1836) [74]. Geometrically, it means that the chord joining 
(x, f (x)) and (y, f()) is parallel to the tangent line at (7, f(7)). 

One may ask for what f does the mean value n(x, y) in (7.4) depend on x and 
y in a given manner. From this point of view, (7.4) can be treated as a functional 
equation with unknown function f and given n(x, y). The function 7(x, y) can be 


any linear or nonlinear combination of x and y—for example, (x, y) = or 


J/xy or sx + ty, etc. With n(x, y) = a, (7.4) becomes (7.3). In connection with 
(7.3), Shigeru HarukiauthorciteH12 and Aczél [31] proved theorems (7.1) and (7.2). 
We present a modified proof from Kannappan and Laird [487]. Note that these results 
are obtained without assuming any regularity conditions on the functions involved. 


The most general form of the functional equation (7.4) is the functional equation 


f@)- 80) _ (= 
——— =¢ ee 
x-y 2 


(7.5) ) forx,yEeR, x Fy. 


Theorem 7.1. [487, 359]. Functions f, g,¢ : R — R satisfy equation (7.5) for all 
x,y € Rwith x # y if, and only if, there exist constants a, b,c such that 


(7.6) f@ms= ax’ +bx+c= g(x) and $(x)=2ax+b, x ER. 
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Proof. To prove Theorem 7.1, we obtain, on interchanging x and y in (7.5), f(x) — 
g(x) = g(y) — fQ), so that f = g. Putting y = 0 in (7.5) yields 


(7.7) f@)=xd (5) + f(0) forall x (valid for x = 0 also). 
Substitution of (7.7) into (7.5) gives 
(7.8) x(x) — yoy) = (@ — y)@(W + y) forall x, y, including x = y. 
First y = —x in (7.8) shows that 
(7.9) (x) + ¢(—x) =26(0) forall x. 
Now use (7.8) and (7.9) to get 
xP(x) = —yb(-y) + @ + y)O@ — y) = VO) + & — VIO +), 
so that 
(x + yIP@— y)-— GO) = @— yd@+y)—(0)] forall x, y; 


that is, 
u(p(v) — (0)) = oP) — (O)] for all u,v. 


So, we see that ¢ is linear (say, 6(x) = ax + b), and by (7.7), f(x) = sax? + 
bx + c forall x, so f is quadratic. This proves the theorem. 


Theorem 7.2. [359, 487, 31]. In R, the functional equations 
fO)= £0) _ 6@) +40) 


(7.10) forx,yER, x Fy, 
x-y 2 

and 

(7.11) Pont 9 (2), forx,yEeR, x#y, 
x-y Z 


are equivalent to each other, where f,¢: R > R. 
Proof. It is enough to prove that ¢ satisfies (J) since (7.10) with (J) gives (7.11) and 
(7.11) with (J) gives (7.10). 

Suppose that (7.10) holds. With y = 0, f(0) =c, ¢(0) =D, (7.10) gives 
(7.12) f@m= 5) +b) +e for allx € R. 


Consequently, equation (7.10) becomes 


(7.13) xh(y) — yh(x) = b(x — y) forallx,y ER. 
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Now, applying (7.13) twice, we have 


2xP(x +y) = (&+ y)P2x) +b(x — y) 


(7.14) 1 
= x6 (2x) + [2x¢(2y) + by — 2x)] + B& — y). 


This is x[(26(@ + y) — O(2x) — d(2y)] = 0. Thus ¢ satisfies (J) for x 4 0. Now, 
to show that ¢ (3) = “—* (ie., it results from (J) when x = 0) for all y, y = 0 
in (7.14) yields 2xo(x) = xf(2x) + bx; that is, 26(x) = d(2x) +b forallx € R. 
Thus (J) holds for all x, y, so that (7.11) is true. 


Remark 7.3. It follows from (7.13) that ¢ is linear (take y a nonzero constant and 
from (7.12) that f is quadratic and that 6 = f’). This result is obtained without 
assuming any regularity conditions on f and ¢. 


Finally, let us assume that (7.11) holds. Now (7.8) also holds. Making use of 
(7.8) twice, we get 
(x — y)@(x — y) = yo(y) + & — 2y)O(*) 
= [xh(x) + (vy — x)b@ + y+ & — 2y) h(x). 


Hence d(x — y) + d(x + y) = 24(x) for x # y, which indeed implies that (J) 
holds for all x, y in R. Hence (7.10) holds. This proves the theorem. 


Remark 7.4. Moreover, since ¢ satisfies (J), 6(x) = A(x) + b, where A satisfies 
(A), and (7.8) becomes x A(x) — yA(y) = @—y)AG+y) = (w—y)(A@) FAQ), 
producing xA(y) = yA(x). So, A(x) = ax for some constant a. Then ¢ is linear 
and f is quadratic. 


Now we present more characterizations of quadratic polynomials. 


7.2 More Characterizations of Polynomials of Degree Two 


Theorem 7.5. [359, 487]. Functions f, g : R > R satisfy the functional equation 


f@)~80) _ f'(x) +80) 


7TA5 
(7.15) _ : 


forx,yER, x Fy, 


if, and only if, f (x) = ax* + bx +. ¢ = g(x) for some constants a,b, and c. 


Proof. Suppose f and g satisfy (7.15). First, we present a short proof using deriva- 
tives. 
It is easy to see that f and g have derivatives of all orders. Rewrite (7.15) as 


fQx)-gi)= — (f'(x) + g’(y)) and differentiate with respect to x to have 


1 = 
(7.16) F'() = 5(F'@) + 8/0) + = f'@, 
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which, by differentiating with respect to y, yields 
f"(«) = g"(y) = constant = 2a (say). 


Then f(x) = ax? + bx +c; and g(y) = ay* + boy + co. Substitution of these f 
and g in (7.16) gives bz = by and in (7.15) gives cz = c}. 
Now we present a proof without resorting to derivatives. 
Second Proof. Letting y = 0 and x = 0 separately in (7.15), we get 
x 

f(x) = xf) + g'(0))+g@) forx 40, 

(7.17) y 
80) = 58'O) + FO) + FO) fory £0. 


Use (7.17) and (7.15) to get 
=(f'@) — £/) =50e'0) - 8'O) + SO -8O) 


forx,y #0, xy. 
From the above, first we obtain (fixing x and y separately) 


f'@) = aix + by, g'(y) = ary + bo, 
and then putting these back in the equation above, we have 
(7.18) f'@) =ax +b, 9x) =ax+by, 


where b; = f’(0), b2 = g’(0), with f(0) = g(0). Utilizing (7.18) and (7.17), we 
get 


f@) =) = =x +b, + box +.c(by — by). 


The converse is easy to verify. This completes the proof of this theorem. 


Now we consider a generalization of (7.15) and prove the following theorem. 


Theorem 7.6. [462]. Let f, g,h : R > R. They satisfy the functional equation 


x) — 
(7.19) POR LO) = 663) +h). forx,yER, x #y, 
if, and only if, f (x) = ax? + (cy +.c2)x +c, g(x) = ax+c1, h(x) = ax 4+ cp for 
some constants a,c, C1, and c2. 
Proof. Suppose f, g,/ satisfy (7.19). Set x = 0 and y = 0 separately in (7.19) to 
get 


fy) =yh(Qy)+eyt+e fory £0, 


(7.20) 
f@)=xg@)+ex+c forx £0, 
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where g(0) = c2, h(O) = c1. These values of f in (7.19) yield 
x(h(y)—c1) = y(ge)—@2), x #Y, Hy #0. 
Fix x and y separately to have 
h(y) =ary+c, g) =a2x+c2 forallx,yeR. 


Substituting these / and g in the equation above gives a; = a2 = a (say), and then 
(7.20) yields 
f@ms= ax” + (cy +.c2)x te. 


This proves the theorem. 


It is noted (Laird and Mills [596, p. 68]) that a nontrivial system 


m 


DIfe —taj)— f(x —tbj)])=0, forallx,t eR, 
j=l 


has solutions that are polynomials of degree m. It also follows that these polynomials 


m m m 
will be of degree 2 when }' (aj—bj) =0, >) (aj—b;) =0, and >) (a; —b}) = 0. 
j=l j=l j=l 0 
Remark 7.7. (Kannappan [462]). Suppose f, g,4,k : F — F, where F is a field 
with more than three elements and char F # 2, satisfy the generalized divided 
difference equation 


f(x) - 8) 
—y 


Xx 


(7.19a) =h(x)+k(y), x#y, x,yER. 


Then the general solution of (7.19a) is given by 


(7.19b) Be = ax* + (c1 +¢2)x +e = g(x), 


h(x) =ax+c, k(x) =ax+c2, 


where a, Cc, Cj, C2 are constants. 
The proof follows as in Theorem 7.6, or see [462]. 


Remark 7.8. For fields of characteristic 2, Remark 7.7 is not true. There are 16 sets 
of solutions that F = Z2. Some of the solutions that are not of the form (7.19b) are: 
f@=x, ga) =x4+1, h@) =x, k(x) =x4+1; f&) =0, g@)=1, h@) = 
x, k(x) =x; and f(x) = 0, ga) =x4+1, h(x) =1, andk(x) =x. 


Remark 7.9, [462]. When F has three elements in Remark 7.7, say F = Z3, the 
general solution of (7.19a) is not of the form (7.19b). As a matter of fact, the solution 
is given by f (x) = ajx?+(an+d)x+b, g(x) = bix*+(bo+c)x +a, h(x) = ayxt+ 
a2, k(x) = bix + bo, for x 4 0, where a = f(0), b= g(0), c=h(O), d=k(0), 
with aj — bj =b—aandd+c=a2+bp. 
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Indeed, from (7.19a) we have 
y = Oyielding f(x) = b+ (h(x) + d)x, x 40, 
x =0 yielding g(y) =a+ (k(y)+c)y, y #0, 


and these f and g in (7.19a) yield 
x(k(vy) —d)+a=y(h(x)-c) +b, x,y#0,x Ay, x,y € Zs. 
By fixing x ¢ 0 and separately fixing y ¢ 0 in the above, we get 
k(y) = biy + bo, h(x) = aix + a2 


and x(bjy + by —d) +a = y(ajx + a2 —c) +b. Since Z;3 has only three elements, 
0, 1,2, and x, y are different from 0 and x ¥# y, the possibilities are x = 1, y = 2 
or x = 2, y = 1, so that we obtain 2b; + bo —d+a = 2(a, + a2 —c) +5 and 
2(bj +b2 —d)+a = 2a, +.a2 —c+b. This leads to the conditions a; — bj = b—a 
andd+c=a2+ bo. 

Thus it is possible to have solutions of (7.19a) with f ~ g when F = Zs. 


We will later need the following result, which is slightly different from Theo- 
rem 7.6. 


Theorem 7.9a. Functions f, g : R > R satisfy the functional equation 


gig) FOL gy 90), forsyER AY Hy #0, 


if, and only if, f is quadratic; that is, f (x) = ax* + bx +c and g(x) = ax + A 
(linear), where a, b,c are real constants. 


Proof. Rewrite (7.19c) as 


(7.21) f(x) — fQ) = & — y\(g@) + gQ)) forx # y, x,y #0, x,yeER. 


Let f, g be solutions of (7.19c). First replacing y by —y in (7.21) and subtracting 
the resultant equation, we have 


fy) — Fy) = 2yg() + & + y)g(-y)— @— yg), x Fy, x,y FO. 
Then set y = —x in (7.21) to get 
f (x) — f(—x) = 2x(g(x) + g(—x)) forx ER. 
These two equations yield 
(7.22) 2yg (x) + (x — y)g(—y) — @& + y)g(y) = 0 


forx Ay, x,y £0. 
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First replacing x by —x in the equation above and adding to it the resultant equa- 
tion, we obtain 


g(x) + g(-x) = g(y) +a(-y) =b (say). 


Substituting this value of g(—y) in (7.22) gives 


b b 
y (sm 7 >) =x (so 7 >): 


that is, g(x) = ax + g (as asserted). This value of g in (7.22) results in 


f (x) — ax? — bx = f(y)— ay’ —by =c (say). 


This proves the theorem. 


Remark 7.10. If the right side of (7.19c) is replaced by g(x) + A(y) for g,h: R- 
IR, exchanging x and y in (7.19c), we can conclude that g and h differ by a constant 
and then we can use Theorem 7.9a to solve this equation. 


7.3 Generalization 


After Theorem 7.5, (7.15) can be written as 


f@)-fO) _ f@M+fo) 


7.23 , 
(7.23) = ; 


x,yeR, x Fy. 


It is not immediately clear what the generalization of (7.23) should be for higher- 
order derivatives. The geometric interpretation of (7.23) is that the slope of the chord 
on an interval is equal to the average of the values of the derivatives at the end points, 
but this, too, offers no clues as to what the generalizations should be. Revelation 
came from an unlikely source—Peano derivatives (see [180]). 


7.3.1 First Generalization Using Derivatives 


Result 7.11. (Cross and Kannappan [180]). /f f : R — R has derivatives of order 
up to n, then the functional equation 


n—-1 k 
_ (x=y)* ¢(k) 

FO> RE FO) peer nf) 
G@— yy" ~@ +0)! 


holds for allx,y € R, x # y, if, and only if, f is a polynomial of degree (n + 1). 
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7.3.2 Second Generalization Without Using Any Regularity Condition 


Result 7.12. [180]. The functions f, g.,h : RR > R, k =0,1,...,n, satisfy the 
functional equation 


f(x) = Do — y)*ee(y) + @ — y)"AG), 
k=0 


forx,y €R, x # y, n = 1, if and only if, f is a polynomial of degree at most 
(n+ 1), 


(k) 
ey) = 2 SY) fork =0,1,....2=1, 
k”™) 
en) = pM) <e, 
Aw) = Gap Pete, 


where c is an arbitrary constant. 


7.4 Another Generalization—Divided Difference 


For distinct x, y ¢ R, f : R— R, define 


= f(y) — f@). 


Then (7.23) and (7.19) can be rewritten as 


wil. £O+ FO) 


[x,y] 2 ; 
wt = g(x) +h(y); 


that is, the left side can be regarded as the divided difference of f on the distinct 
points x and y. This gives a clue to generalization by divided difference. 
For distinct points x1, x2,...,x, in Rand f :R — R, define 


1 x2 x2--- x27! . 
[x1,%2,..-,Xn] = 2 2 =| [ai --x,)), Gf 1, 2,02,.0) 

eee ee ee ee ii 

lL xp.xg es xy : 
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(Vandermonde’s determinant), 


1x, xe cvext * fei) 
ee) x5 ++ xR? £ (x0) 


ly eo en—* FG) 


The divided difference of the function f : R — R on the distinct points 
X1,%X2,..-,X, in R is defined by (Kannappan and Crstici [483]) 


[xi,%2; sey Xn fi) 
[x1, X2, we Xn] 


(7.24) 


The divided difference denoted by f[x1, x2,..., Xn] and defined recursively by 


flxo] = fo), 
(xo) — f@1) 
jin. 
X0 — X1 
Sf lxo, x1, cee »Xn—1] = Palace. see Xn] 
X0 — Xn 
in Bailey [84], Bailey and Fix [85], and Sahoo and Riedel [722] is the same as the 
divided difference defined in (7.24) [483]. 
Now characterization of polynomials of degree n can be obtained by using the 
divided difference. For example, consider the functional equations 


eer) 


S(x0,*1,---5Xn] = 


[Hisccev tnt Ff] 

(7.25) OS = g(x) +++ + 8 Grn), 
[ise vig Xn] 
Bis ccias Bas 

(7.26) ae = g(t +X2+++++Xn), 
[X1,..-5Xn] 

where f, g : R— Rand x1,..., Xp» are distinct reals. 


It is easy to see that 


(7.24) ijsenstar tS) ogy SG = ae 


[x1,-.-5%n] 4 I] @j - xi)’ 
SAL 
Now we will prove some results and state some others (see [84, 722, 182, 462, 


483, 511)). 


Theorem 7.13. Suppose f, g : R — R satisfy the functional equation (7.25), where 
X1,%X2,...,Xn are distinct points. Then, and only then, f is a polynomial of degree 
at most n and g is linear; that is, a polynomial of degree 1. 


n—2 
Proof. If f is a solution of (7.25), so is f(x) + © axx*. So, without loss of gener- 
k=0 


ality, choose distinct points yj, y2,..., Yn—2 different from 0 such that f(0) = 0 = 
fO1) =-:: = fOn-z). Clearly there are plenty of choices for yj, ..., yn—2. 
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Letting (x,0, y, y1,.--, Yn—3) for (41, x2, ..., Xn) in (7.25), we obtain 


_ f@) _ f(y) 
X(y —x)Q1 — x) +++ On-3 —*) & — y)yO1— y) +++ On-3 — y) 
= g(x) + g(y) +¢1, 


where c; = g(0) + g(y1) +--+: + 8Qn-3). 


Define h(x) = wot so that the equation above can be written as 


(7.27) — =hy) _ 


=g(x)+g(y)+c forx Ay, x,y £0, 


which is (7.19a) and (7.19c). Then, from Theorem 7.9a, it follows that g is linear 
and h is a polynomial of degree 2; that is, f is a polynomial of degree n. This proves 
the theorem. 


Corollary 7.13a. (Kannappan and Crstici [483]). If f, f’,..., f{™ : R > R, then 


(7.25a) D1, Xn ss Xn fT _ 1 Lf Gi) core Fo Ge) | 
[x1,%2,-.-,Xn] n! 
or all distinct x,,x2,...,Xn in R if, and only if, f is a polynomial of degree at 
y poly § 


most n. 
Take g = a fe) in Theorem 7.13. Note that Theorem 7.13 was proved without 
any regularity condition on f. 


Corollary 7.13b. [483]. [f f, g1,..., 8n : R — R satisfy the functional equation 


[tam fl oy 
(7.25b) =D ge (Xe) 
[x1,.--5Xn] k=1 
for all distinct x1, ...,Xn in R, then f is a polynomial of degree at most n and the 


g's are linear and differ from each other by constants. 


Proof. By following the steps of the proof of Theorem 7.13, we arrive at 


(7.27b) AG nO) = gi(x) + g2(y) + constant. 


x—y 
Now, by using Remark 7.10, we obtain the result sought. 


Result 7.14. (Crstici and Neagu [182], Kannappan and Sahoo [511]). Jf func- 
tions f,g : R — R satisfy the functional equation (7.26) for a set of n distinct 
points X1,X2,...,Xn in R, then f is a polynomial of degree at most n and g is 
linear. 
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7.5 Generalization of Divided Difference 


For distinct points x1, ...,X, in R and functions fi,..., fr : R — R, we define 


Lxy sx? fir) 
La fal = |! 22°72 22) 


[iy X2pe.0 5X At fis 


and the generalized divided difference [462] by 


(7.24a) [et 25 +1195 fly fa ++ +s Jad 
[%1;.x2; es Xn] 
which is a generalization of (7.24). 
It is easy to check that the generalized divided difference (7.24a) is given by 


n 


ee eae Sk (xx) 


[x1,.-.5Xn] 7 (xe — X1) +++ Xe Xe-1) OR X41) ++ OR Xn)” 


The following results hold [462], which are generalizations to Theorem 7.13 and 
Result 7.14. 
Result 7.15. (Kannappan [462]). Let fi, g,: R-> R@,k =1,2,...,n) satisfy the 
functional equation 


n 


[V1.5 tn Sle sais Fal _ 
Eee = 2, g(a) 


for distinct x1, X2,...,Xn in R. Then all fj’s are equal, f\ is a polynomial of degree 
at most n, and the g;’s are linear functions differing by constants. 
Result 7.16. [462]. Suppose fi,g : R—> R@=1,2,...,n) satisfy the functional 
equation 
[Misessts tee Tignes fal 
[Xts0635 XH] 
Then all f;’s are equal, f\ is a polynomial of degree at most n, and g is linear. 


= g(x, +--+ +X). 


Remark 7.17. Note that no regularity condition was used in the two theorems above. 


7.6 Problem of W. Rudin and a Generalization 


In the American Mathematical Monthly, W. Rudin [716] posed the following prob- 
lem: Find all differentiable functions f : R — R that satisfy 
(7.28) LOI) _ eee), ody HER, 

x—y 
where s and ¢ are given real numbers. In this connection, we prove the following 
theorem (see also E3338 [235] and Sahoo and Riedel [722]). 
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Theorem 7.18. The function f : R — R satisfies equation (7.28) for all x,y € 
R, x # y, ands and t are given reals, if, and only if, 


ax* + bx +c, ifs =t=4, 


bx +c, otherwise, 


(7.29) f@)= | 


where a,b, and c are real constants. 


Proof. First we rewrite (7.28) as 


(7.30) f(«)- f0) = @ — y) f' (sx + ty), x#Y, x,yeER. 


It is easy to see that f has derivatives of all orders. (We do not really need this 
fact.) If f is a solution of (7.30), so is f(x) — f (0). So, without loss of generality, 
assume f (0) = 0. Now we dispose of some special cases of s and f. 

With s = 0 =f, (7.30) becomes f(x) — xf’(0) = f(y) — yf’(0), implying that 
f is linear. 

Next assume s = 0, t 4 0 (s 4 0, tf = Ois similar). Then (7.30) is f(x) — 
fy) = « — y)f'(ty). Setting y = 0 (or interchanging x and y) shows that f is 
linear. Now, we assume thats #0, t £0. 

Letting y = 0 and x = 0 separately in (7.30), we have f(x) =xf'(sx), x 40 
and f(y) = yf’ (ty), y 4 0, so that (7.30) can be written as 


(7.31) xf'(sx) — yf'(ty) =@—y) f(x tty), x Ay, x,y #0. 

Replace x by = and y by 2 in (7.31) to get 

732) ='@)-2 fF) =(]-2) ety, xAy, ty £0. 
Ss t S t 


Interchanging x and y in the equation above and subtracting the resultant, we obtain 


Ci, 11 
(- + 7) xf’) — yf'O) = (- + 7) (x—y)f'@ty), xAy, x,y £0. 
Suppose + + + = 0; that is, f = —s. Then (7.32) becomes 


xfa)tyfM=e@+yf'aty), xAy, xy #0. 


Defining A(x) = xf’(x) for x € R, we see that A is additive and A(x) = cx 
(here we use that f’ is continuous), so that f’(x) = c; that is, f is linear. 
Finally, we consider the case + + + # 0. Then (7.33) yields 


(7.34) xf'@®)—yfO=@-y fatty), x#y, x,y #0. 


Now we will show that f is quadratic (that is, a polynomial of degree 2). This 
can be achieved with and without using that f has higher derivatives. 

First assume that f has higher derivatives, say up to 3. Differentiating (7.34) first 
with respect to x and then with respect to y, we get 
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Ge — yf" + y) =0; 

that is, f(x) = O or f is quadratic. This f in (7.29) shows that s = t = 5: 

Now we prove this result without assuming higher derivatives of f. First y = —x 
in (7.34) gives 

f'@) + f(x) =2f'(0) for all x. 
Then change y to —y in (7.34) and subtract the resultant to obtain 
a+y)f/@-y)-@-ywf@+y)=yG'O) + f(y) = 2yf'O) 
=[@+y)-@— yf’) 

(that is, 


+ y)(f'@ — vy) — £0) = @ —y)F'@ +y)) — FO), 
showing thereby that f’(x) = ax +b or f is quadratic. This proves the theorem. 


7.7 Generalization of Rudin’s Problem 


Result 7.19. (Kannappan, Sahoo, and Jacobson [519]). Let s and t be real para- 
meters. Functions f, g,h : R > R satisfy the functional equation 


(7.35) iD eet 0 eee, forx,yER, x Fy, 
my 
if, and only if, 
ax +b ifs =O=¢t 
ax +b ifs =0, t 40 
f(x) = {atx*+ax+b ifs=t40 
A(to ‘ 
AG) 4b ifs =—t £0 
Bx +b es er, 
ax +b ifs =O=t 
ax +b ifs =0, t #0 
g(x) = jatx*+ax+b ifs=t 40 
A(tx) Bi 
ate ifs =—-t £0 
Bx +b gra, 
arbitrary withh(0)=a ifs=O=t 
a ifs =0, t £0 
h(x) = yax+b ifs =t4#0 
4G) 4 fy ifs =-t £0, x £0 
B ea 


where A : R — R is an additive function and a,b,c, a, B are arbitrary real con- 
stants. 
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7.8 Fréchet’s Result 


Let E be a normed vector space over R. The general differences of g : E — Rare 
given by 


An) = a + x9) — 2), +«: 
Ag ottsnrtn8 ®) = Aba er,...ay lB + Xn) — 8@)I 


(called the (n + 1)th-order difference) and the equidistant differences by 


Al g(u) =g(ut+ov) —g(u),... 
A" g(u) = A"“'[g@u +0) — gWu)] 


(7.36) =>. ("Jip ew + jv) 
j=0 


(nth-order difference). 


Fréchet’s result. (Fréchet [288, 287], Amir [73]). If a continuous function g : R > 
R satisfies 


(7.36a) bs (“Joe +jv)=0 foru,veR 


j=0 


(that is, the nth-order difference of g is zero), then g is a polynomial of degree < n. 
In Theorem 7.20, we obtained the general solution of (7.36a) for n = 3 without 
assuming any regularity condition on g. 

First we consider the definition of polynomials due to Fréchet [290] and others 
(Mazur and Orlicz [627], Van der Lyn [809]). 

Let A: E — R be additive. We call A, : E” — R n-additive if Ay is a 
homomorphism in each variable; that is, if A, is additive componentwise (n = | 
additive, n = 2 biadditive). The diagonalization of the n-additive function A, is 
denoted by A*; that is, 


A*(x) =An([xX]n) forxe E 


(here [x], denotes the element of E” all components of which are equal to x). 
Evidently, Ay = A forn = Oor 1. 


Definition. (Fréchet [290], Székelyhidi [796]). A continuous function f : E > R 
will be called a polynomial if and only if f has a representation as the sum of diago- 
nalizations of multiadditive functions from E into R; that is, f has a representation 


(7.37) f=> AD 
k=0 
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where n is a nonnegative integer and A, : E* > R is a k-additive function (k = 
0, 1,...,2). (We say that f is a polynomial of degree at most n.) 

Now we turn our attention to (7.36a). First note that the case n = 2 reduces to 
the Jensen equation (J) and that g(x) = d + A(x), where A is additive (cf. (7.37)); 
that is, f is a polynomial of degree at most 1 (with n = 1). 

For n = 3, (7.36a) yields 


(7.38) g(u + 3v) — 3g(u+2v)+3g(u+v)—g(u)=0 foru,v ER. 


Now we prove the following theorem. 


Theorem 7.20. (Kannappan [470]). Suppose g : E — R has all its third-order 
differences zero; that is, g satisfies 


(7.38A) g(u + 3v) — 3g(u+ 20) + 3g(u+v) — g(u) =0 
foru,v € E. Then g has the form 
(7.39) glu) =d+Atu)+qtu)=d+Atu)+ Buu,u) forue E, 


where d is a constant, A is additive, q satisfies (Q), and B is symmetric and biaddi- 
tive; that is, f is a polynomial of degree at most 2 (see (7.37)). 


Proof. Replace u by u — v in (7.38A) to get 

(7.40) g(u+2v) —3g(u+v)+3g(u)—-—g(u—v)=0 foru,ve E. 

First, put uv = 0 in (7.40) to have 

(7.41) g(2v) = 3g(v) + g(—v) —3g(0) forue E. 

Second, interchange u and v in (7.40) to obtain 

(7.42) g(Qu+v) =3gtu+v)+g(v—u)—3g(v) foru,v€ E. 

Third, replace u by 2u in (7.40) to get 

(7.43) g(2u + 20) = 3g(2u+ 0) + g(Qu —v) —3g(2u) foru,v € E. 
Using (7.42) and (7.41) in (7.43), we have 


2g(u + v) + g(u—v) + g(v — u) = 3g(u) 


(7.44) 
+ g(—u) +3g(v0) + g(—v) —4g(0) foru,v € E. 


Replacing v by —v in (7.44) and subtracting the resultant equation from (7.44), 
we get 


(J) J(vo+tu)+J(o—u)=2J(v) foru,vnEeE 


(which is Jensen), where J(x) = g(x) — g(—x) forx € E. 
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Since J is odd, /(0) = 0 and 
(7.45) J(x) = g(x) — g(-x) =A(x) forxe E, 


where A : E — Ris additive. 
Substituting of (7.45) into (7.44) yields 
2g(u+v)+2g(u—v) —A(tu—vd) 
= 4g(u) — A(u) + 4g(v) — A(v) — 4g(0)  foru,ov € E; 
that is, 
qtu+v)+qu—v)=2q(tu)+2q(v) foru,v € E, 
where 


t} 
qu) = glu) — zAw) —g(0) foruecE, 


meaning gq satisfies (Q). Now (7.39) follows from Theorem 4.1. This completes the 
proof. 


Remark 7.21. If, in Theorem 7.20, E = R, and g is continuous, then f is a poly- 
nomial of degree at most 2 (see Fréchet [288, 287], Amir [73]). In the general case, 
compare (7.37) forn = 2. 


The following most general result is true. 


Result 7.22. Suppose g : E — Rhas its (n + 1)th-order difference be zero; that is, 
g satisfies (7.36a) for u,v € E. Then g is given by (7.37), where Ax are symmetric, 
k-additive functions (for k = 0 to n); that is, g is a “polynomial” of degree at most 
n. In the special case E = R, g is continuous and satisfies (7.36a). Then g is a 
polynomial of degree at most n. 


The equation (7.36a) is known as the Fréchet equation. For proof, see [796, 
Theorem 9.1] or use [53, p. 66, 4.2.3, etc.]. 


7.9 Polynomials in Several Variables 


For functions of several real variables, conditions similar to (7.36a) can be given. 
Let C(R”) denote the set of all complex-valued functions defined on R”, Dj = = 
For f € C(R"), let Ain f(x) = f(x + he;) — f(x) for x € R",h real, where 
{e1,€2,..-,€n} denotes the standard basis for R”. Characterization of polynomi- 
als of several variables can be given by way of translation- and dilation-invariant 
subspaces. Fora € R", t € R, f € C(R”), let 7, f and J; f be defined by 
Ta f (x) = f(x —a) and I, f (x) = f (tx) for all x € IR”. Let X ¢ denote the subspace 
of C(R") spanned by all translations T, f (fora € R”) and T, f (fort € R). Let R; 
denote the subspace of C(R”) spanned by all translations T, f of f and all orthog- 
onal transformations 0, f : x — f(P,x), where P is any orthogonal real matrix. 
The following theorems are true (see Laird and McCann [595]). 
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Result 7.23. Let f € C(R"). The following conditions are equivalent: 


(a) f is a polynomial. 
(b) For some positive integer m, Di} f =O /for j = 1 ton. 


(c) For some positive integerm, Di”, f =0 for all realh andi = \ ton. 


Result 7.23A. Let f € C(R”). A necessary and sufficient condition that f be a 
polynomial is that X ; be finite-dimensional. 


Result 7.23B. For f € C(R"), a necessary and sufficient condition that f be a 
polynomial is that Rj; be finite-dimensional. 


7.10 Quadratic Polynomials in Two Variables 


By the mean value theorem for functions of two variables f : R? —> R7 (see Courant 
[176, p. 80]), it is well known that 


(7.46) fx +h,y+k)— f@,y) =Afr(x + Oh, y + Ok) + kfy(x + 6h), y + Ok 


holds, where 0 <@ <1, fy, and f, and f) are partial derivatives. 


Setting x +h =u, y+k =v, 0= 5 in (7.46), one obtains the functional 
differential equation 
(7.47) 


Fue) fey) = uf ( =) o-n(S a). 


2" 2 2 


If we replace the partial derivative by unknown functions g and h, (7.47) 
becomes 


(7.48) fu.) — f(x,y) = (u—x)gx tu, y +0) + (0 — yh +u, y +d) 


forx,y,u,v ER, f,g,h:R* SR. 
The following results are true [513]. 


Result 7.24. Ifa quadratic polynomial f (x, y) = ax*+bx+cy?+dy+axy+f, with 
a £0, c £0, is a solution of the functional equation (7.48) assuming x, y,h,k € R 
with h* +k? # 0, then@ = 5. Conversely, if a function f satisfies (7.48) with 8 = 5, 
then the only solution is a quadratic polynomial. 


Result 7.25. Suppose f,g,h : R? — R satisfy (7.48) for all x, y,u,v € R with 
(u—x)°+(v—y)? 40. Then f is of the form f (x, y) = ax*+bx+cy*+dy+a+ 
B(x, y), where B : R? —> R is biadditive and a, b,c, d, a are arbitrary constants. 
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7.11 Functional Equations on Groups 


In this section, first we consider two functional equations treated in Sablik [719], 


(7.49) f@)-g)=hat+y)a-y), xFy, 
(7.50) f(x) — gy) = k(x) + 0) @ — y), 


for x,y € G, where f,g : G > H, h,k,€: G — Hom(G, H), and G, H are 
Abelian groups. These equations on R were treated in [359, 31, 487, 462], etc.; cf. 
(7.10), (7.11), (7.15), (7.19a), ete. 

Equations (7.49) and (7.50) are solved in Theorems 7.26 and 7.27, where x, y 
are in Abelian groups. It is necessary to explain what the multiplication on the right 
sides means. Note that, in the real case, h(x) and k(x) + €(y) may be identified with 
linear mappings “t — h(x)t” and “t > (k(x) + €(y))r’”. That is what is required of 
h(x) and (k(x) + €(y)) for x, y € G; that is, we assume h(x) and k(x) + €(y) are 
homomorphisms from G to H for x, y € G. 

First we treat (7.49). 


Theorem 7.26. [719]. Let G(+) and H(+) be Abelian, 2-divisible groups. The 
functions f,g : G > H, h: G > Hom(G, H) satisfy the functional equation 
(7.49) for allx,y € G, x # y, if and only if 


f(x) = g(x) =b + AQ) + Bix, x), 


Pen) Koj= AA B.S, 


for x € G, where bis aconstantin H, A: G — H is additive,andB:GxG— H 
is a symmetric, biadditive function. 


Proof. Interchanging x and y in (7.49), we get 


fy) -—g@) =h& + y)\ -x) = -(f() - gQ)); 


that is, f(x) — g(x) = —(f() — g()) = c (say). Then 2c = 0, soc = 0 since H 
is 2-divisible (this is the only place we use 2-divisibility of H). Thus f(x) = g(x). 
Set y = 0 in (7.49) to have 


(7.52) f(x) =g(0)+h@)x forx €G, x £0. 

Putting (7.52) into (7.49), we obtain 

(7.53) h(x)x —h(y)y =h(xt+y)a—y) forx,yeG,x#y, x,y £0. 
Put y = —x in (7.53) to have (note that h(—x) and h(0) are homomorphisms) 


(7.54) (h(x) + h(—x))(x) = 2h(O)(x) forx eG, x £0. 
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Replace y by —y in (7.53) and subtract the resultant from (7.53) to get 


h(x —y)ax+y)-h(e + y)@— y) = (hy) +h(-y))Q) 
=2h(0)(y) by (7.54) 
=hO)x+y—-(@—-y)] 


or 


[h(x — y) — AO) @ + y) = [A + y) —hON@ — y). 


Letting x —y =u, x+y =v foru,v € G, v £ 0 (possible since G is 2-divisible), 
we have 


(7.55) (h(u) — h(O))(v) = (A(v) — h(O))u, u,v € G, v £0. 
Define B: G x G > H by 
(7.56) Btu, v) = (A(u) —hA(O))(0). 


Then B is additive in the second variable and (7.55) shows that B is symmetric. 
Thus B is a symmetric biadditive function. Equation (7.56) gives h(u)(v) = A(v) + 
B(u,v), where A = h(O) € Hom(G, H). Equation (7.52) now gives (7.51). This 
proves the theorem. 


Now we consider (7.50). 


Theorem 7.27. [719]. Suppose f,g : G — H,k,€:G — Hom(G, A), where 
G(+) is an Abelian group and H is a 2-divisible Abelian group. Then f,g,k, € 
satisfy equation (7.50) if and only if 


f(&) = g(x) = b + 2A2(x) + 5 Bala, x), 
(7.351) k(x) = (A1 + A2) + Bix, -) + Bo, :), 

€(x) = (A2 — Ai) + Box, -) — Bix, -), 
where Aj, Az € Hom(G,H) : Bz : G x G — H is symmetric and biadditive, 
B,:G x G => FH is skew-symmetric and biadditive, and b € H is a constant. 


Proof. Suppose f, g, k, € satisfy (7.50). Then x = y in (7.50) gives f(x) = g(x) for 
xeEG. 

First y = 0 in (7.50) yields f(x) = f(O) + (k(x) + €(0))(x), x € G, and then 
x = 0in (7.50) yields f(y) = f (0) + (€(y) + k())(Q) for y € G. 

From the last two equations, first we get 


(7.58) (K(x) — €@))(x) = (KO) — €0))@) = Ai), say, 
where A; € Hom(G, A); that is, A; is additive and then we have 

1 
(7.59) f@) = f()+ 7k) + €(x) + A2)(x), 


where k(0) + €(0) = Az € Hom(G, A). 
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Interchange x and y in (7.50) to have 
(7.60) f(y) — F@) = K&O) + £@)O —x)  forx,y €G. 
Adding (7.50) and (7.60) and using (7.58), we have 
(k(x) — €@))) + KO) — €0))@) = Aix) + Ally) forx, y € G; 


that is, 
(k(x) — €@) — AY) = —&O) — E(y) — Ar) (@). 
Define B} : G x G > H by 


Bix, y) = (k@) — €@%) —At)Q)  forx,y eG. 
Then B, is a skew-symmetric, biadditive function and 
(7.61) (k(x) — €@))) = 410) + Bi, y) forx,y eG. 
Subtracting (7.60) from (7.50), we obtain 
2(f &) — FO) = &@) + €@) + KY) + EO) — y), 
which by the substitution of f(x) from (7.59) results in 
(k(x) + €() — A2)(y) = K&(y) + €(y) — Az)@)  forx,y EG. 
Defining B2 : G x G > H by 
Bo(x, y) = (kK) + E(x) — A2)Q), 
we see that Bz is symmetric and biadditive and 
(7.62) (k(x) + €@))() = Ary) + Box, yy) forx,y EG. 
From (7.61), (7.62), and (7.59), follow (7.57). This completes the proof. 


7.12 Rudin’s Problem on Groups 


Now we treat the functional equation 


(7.63) fx) — f(y) =h(sx +ty)\a—y) forx,yeG, x Fy, 


where f : G > F (Ga divisible Abelian group, F a field of characteristic zero), 
h: G — Hom(G, F) (an element A € Hom(G, F) means A : G — F is such that 
A(x +y) = A(x) + AQ) for x, y € G). Recall that x € G is n-divisible if there is a 
y € Gwith ny = x and G is divisible when each x in G is divisible by every n > 0 
[713]. In order to make sense in the argument sx + ty in G, we assume that s and 
t are integers. We prove the following theorem (see Theorem 7.18 and Result 7.19, 
and refer to E3338 [235], Sahoo and Riedel [722], and Kac [408]). 
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Theorem 7.28. The functions f : G > F, h : G — Hom(G, F) satisfy the 
functional equation (7.63) forx, y € G, x A y, ands and t integers if and only if 


arbitrary forx £0, 
A forx =0, 
f(x) =A) +b, h(x) =A, forx €G,s 40, t=0, s=0, t 40, 
f(x) =A) + Bax, x) +b, h(sx) =A+B(x,:), x €G, s=t 40, 
f(x) =AQ) +b, h(sx)(x) = A(x), forx eG, t=-s £0, 

f(x) = A(x) +b, h(x) =A, forx eG, s° #1? £0, 


xéE€G,s=0=t, 


f(x) = A(x) +b, h(x) = | 


where A € Hom(G, F), B: G x G > F is biadditive and symmetric, and b € F 
is a constant. 


Proof. It is easy to verify that f and h given above are solutions of (7.63). We prove 
the converse in three cases according to whether s and ¢ are zeros or not. 


Case 1. Suppose s = 0 =f. 
Then (7.63) can be rewritten as 


F(x) —hO@) = fO)—hOG) forx,yeG, x Fy. 


Now y = O yields f(x) = A(x) + b, where b = f (0) and h(O) = A € Hom(G, F) 
for x ~ 0. But this equation holds for x = 0 also. h(x) is arbitrary for x 4 0. 


Case 2. Suppose s = 0, t £0. 
Equation (7.63) becomes 


f(x)-—fO)=hGy)@—y) forx,yeG, x Fy. 
First y = 0 yields 

f(x) =h(O)(x) + fO) =A)+5b  forallx € G, 
where A is additive. Then x = 0 gives 


f(y) =h(ty)Q) + FO) forall y € G. 


Thus h(ty) = A for all y; that is, h(x) = A for all x € G. Here we use that G is 
divisible. 
Case 3. Suppose s 4 0, t 4 0. This case is subdivided into three cases according to 
whether s = f or not. 

First of all, x = 0 and y = 0 separately in (7.63) give 


f(x) =h(sx)@)+b forallx €G, 
f(y) =h(ty)Q) +5 forall y € G, 
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where b = f (0) so that (7.63) becomes 
(7.64) h(sx)(x) — h(ty)(y) = h(sx +ty)\(«—y) forx Ay, x,y EG. 


Subcase 1. Let s = ¢. 
Then (7.64) becomes 


h(sx)(x) —h(sy)\y) =h@+y)@—y) forx,yeG, x Fy. 
Define g: G—> F, H : G > Hom(G, F) by 
g(x) =h(sx)(x) and A(x)=hA(sx) forx EG, 
so that we can rewrite (7.64) as 
ga)-sQ)=H@t+y)a-y) forx,yeG x Fy. 
Then, from Theorem 7.26 (refer to [722]), 


h(sx)(x) = g(x) =a0 + A(x) + BG, x), 
h(sx) = H(x)=A+B(x,-) forx eG, 
where A € Hom(G, F) and B : G x G > F is a symmetric, biadditive function, 


ao € F. 
It is easy to see that ag = 0 and 


f(x) =h(sx)x) +b= A(x) + BO,x) +b forx eG. 


Subcase 2. Let s = —t. 
Substitute ¢ = —s in (7.64) to get 


h(sx)(x) — h(—sy)(y) =h(s@ — y))\(@—y) forx,yeG, x Fy. 
Changing y to —y, we have 
A(sx)Qx) +hy)\y) =ho@+y)@t+y) forx,yeG, x Fy. 
Define A: G > F by 
A(x) =hA(sx)(x) forx € G. 
Then A is additive on G and 


f(x) =h(sx)(x)+b=A(x)+b forxeG. 


Subcase 3. Suppose s* 4 t. 
Replace x by tx and y by sy in (7.64) to have 


h(stx)(tx) — h(sty)(sy) =h(st(x + y))(tx —sy) forx,yeG,xFy. 
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Interchanging x and y in the equation above and adding the resultant equation, we 
obtain 


(t — s)[h(stx)(x) + h(sty)(y)] = t — s)AGst@ + y)@+t+y); 
that is, 
h(stx)(x) + A(sty)(y) =A(sta+y))(x+y) forx,yEeG,xFy. 


Here we make use of F as a field with characteristic zero, s — t #£ 0, and h(sx) isa 
homomorphism. 
Define A: G > F by 


A(x) =A(stx)(x) forx € G. 
Then A is a homomorphism and 
f(x) =h(sx)@) +b forallx € G. 
Now 
f (tx) = h(stx)(tx) +b = tA(x) +b 
=A(tx)+b forx €G, 


so that 


fQ)=AQ)+5b forally eG. 
Here we use that G is divisible. This f in (7.63) yields 
A(Qx—y)=hA(sx+ty)«—y) forx,yeG,xFy. 
Replacing x by x + y, we get 
A(x) =h(sx +(s+t)y)(x) forx,y €G. 
Let u € G. Then there is a y € G with (s + t)y = u (also use s + ¢t £0) and 
A(x) = h(sx + u)(x). 


Given x,v € G, there is au € G such that sx + u =v and A(x) = h(v)(x) so that 
h(v) = A for all v € G. This completes the proof. 


7.13 Generalization 


We consider a functional equation that is a generalization of (7.5), (7.10), (7.19), and 
(7.19a) and arises in connection with Simpson’s Rule [722], namely 


(7.65) f@)—- gy) =@-—y)h@ +y)+ka)+€Qy)) forx,y eR, 


where f, g,4,k, €: R — Rand prove the following theorem (see [722]). 


7.13 Generalization 


Theorem 7.29. The functions f, g,h,k,€ : R — R satisfy the functional equation 


(7.65) for x, y € R if and only if 


fx) = g@) = b1x4 + box3 + b3x* + dx +b, 


Ges h(x) = zbix? + xbox? + A(x) +1, 
: k(x) = $b x3 + sbox? + b3x — A(x) + c2, 
(x) = $b x3 + sbox* + b3x — A(x) + c3. 


where A : R — R satisfies (A) and b, bj, b2, b3, ¢1, c2,¢3 are constants with 


d=c2+034+¢1. 


Proof. This result appears in Sahoo and Riedel [722] with a very lengthy proof. Here 
we present a simple, direct proof. Let f, g,h,k,€ be solutions of (7.65). Setting 


y = x in (7.65), we have f(x) = g(x) forx € G. 


If f (x) is a solution of (7.65), so is f(x) — f (0). So let us assume that f (0) = 0. 


Interchanging x and y in (7.65) and adding the resultant to (7.65), we have 
(x — y)(kK(@x) — €@) + £0) —kQ)) = 0 forx,y ER; 

that is, 
(7.66) k(x) — €(x) = k(y) — €0) =c =k(O)— C(O) forx #y. 
Putting this value of k(x) in (7.65), we get 
(7.67) f@)-f)=@-—yA@t+y)+€@)+€)+e) forx,y ER. 
Set y = 0 in (7.67) to get 
(7.68) fx) =x(h(x) + €@)+k(0)) forx ER. 
Substituting for f (x) from (7.68) into (7.67), we obtain 


(x — y)h& + y) + x(C(y) — €0)) — yE@) — €O)) = xh(x) — yh(y) 
forx,y ER, 


which, after adding xh(y) — yh(x) to both sides, becomes 

(7.69) (x — y)((x + y) —h@) — AQ) = yo) -— xO), x,y ER, 
where ¢ : R > Ris given by 

(7.70) d(x) = C(x) +h(x) — €(0)_ forx ER. 

Let y = —x in (7.69) to have 


(7.71) 2(h(0) — h(x) — h(—x)) = —(@(*) + (—x)) forx ER. 
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Changing y into —y in (7.69) and adding the resultant equation to (7.69) and utilizing 
(7.71), we obtain 


(x + y)A@ — y)— A) +h(y) — A(0)) 


(7.72) 
+ (a — y)\(A(x + y) —h(@) +h(—y) —A(0)) =0, x,yER. 


Exchange x and y in (7.72) and add the resultant to (7.72) to have 


(x + y)(AX — y) +h(y — x) — 2h(0)) = & — y) (AG) + A(—-x) — h(y) — h(-y)); 
that is, 

(7.73) (x + y)t( — y)=@—y)¢@)—-tQ)) forx,yeR, 

where ¢ : R — Ris defined by 

(7.74) t(x) = h(x) +h(—x) —2h(0) forx ER. 


Note that (0) = 0 and ¢ is even. 
Replace y by —y in (7.73) to get 


(7.75) (@-—yit@ty)=@+y)¢@)-tQ)) forx,y eR. 

From (7.73) and (7.75), we obtain ((7.73) x (x + y) — (7.75) x (x — y)) 
@+y/t@-y)=@-y)t@+y), 

so that 

(7.76) t(x) =box* forx ER, 


where bo is a constant. 
Change x to —x in (7.72) and add the resultant to (7.72) to have 


(7.77) 2xs(x)— 2ys(y)=@+y)s@—y)+@—y)sa+y) forx,y ER, 
where s : R —> Ris defined by 
(7.78) s(x) =h(x)—h(—x) forx ER. 


Note that s(0) = O and s is odd. If we determine s, then (7.74) and (7.76) give h. 
Using this in (7.69) will yield ¢ and hence @ in (7.70). Then we can obtain f from 
(7.68). We now proceed to find s. 

Applying (7.77) twice, we get 


2xs(x) = 2(x + y)s(x + y) + 2x + y)s(—y) — ys(2x + y) 
= 2(x — y)s(x — y) + 2x — y)s(y) + ys(2x — y), 
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which (by using (7.77) again and that s is odd) results in 


Axs(y) + ysQx — y) + ysQx + y) =2 + y)s@ + y) —2(% — y)s(x -— y) 
= 2xs(2y)+2ysQ2x) forx,yeER. 


Replace 2x by x in the equation above to obtain 
(7.79) y(sx + y) +s(x — y)) = x(s@y) — 28(y)) + 2ys(Qx) forx,y ER. 
Interchanging x and y in (7.79), we have 
(7.80) x(s(e+y)— s(x — y)) = y(s(2x) — 2s(x)) + 2xs(y) forx,y ER. 
Now (7.79) x x + (7.80) x y yields 

2xy(s(x + y) — s(x) — 5(9)) = x2(s(2y) — 25(y)) + y2(82x) — 25(x)): 
that is, 
(7.81) 9 s(x + y)= s(x) +s5(y)+xpQ)+ yp(x) forx,y ER, x,y £0, 


where p : R — Ris defined by 


SOx) 2s) forx eR, x #0 
P(x) = . 
0, forx = 0. 


Note that p is even and that (7.81) holds for all x, y € R. Applying (7.81) four times, 
we get 


s(x +yt+z)=s(&%+y)+2) 
= s(x) +s(y) + s(z) +xp(y) + yp) + (& + y) pz) + zp + y) 
s(x + (y +2)) 


= s(x) + s(y) + 8(z) + yp(z) + zp) + xp +2) +0 +2) P@), 


which results in 
(7.82) x(p(y +z) — p(y) — pP@)) = z(P@+y)— p(x) — pQ))  forx,y,zeER. 
Changing z to —z in (7.82) and adding the resultant to (7.82), we have 


(Q) P(y +2) + ply —z) =2pQ) +2p(z) fory,z eR. 


Thus p(x) = B(x, x), where B : R x R > R is symmetric and biadditive (see 
Theorem 4.1). 
Putting this value of p(x) in (7.82) yields 


xB(y,z)=zB(x,y) forx,y,zER. 
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With z = y, we get x B(y, y) = yB(x, y) for x, y € R. Exchange x and y to obtain 
yB(x,x) = x B(x, y), so that finally we have x*B(y, y) = y*B(x, x) and 


(7.83) p(x) = Bx, x) =b\x? forx ER, 


where b, is a constant. 
From (7.81) and (7.83) there results 


s(x + y) =s(x) +5(y) + bixy? + dix?y 


= s(x) +509) + xbie +9)? 23 ys 


that is, 
A(x +y)=A(x)+AQ)) forx,y ER, 
where 
A(x) = s(x) — six’ forx ER. 
Thus 
(7.84) h(x) — h(—x) = s(x) = sx + A(x) forx eR. 


Now (7.84), (7.76), and (7.74) yield 
1 ae 1. 1 
(7.85) h(x) = —— + ee + 5A@) +c, forx eR. 
h(x) from (7.85) in (7.69) gives 
1 
(@-y) (vory + sbixy@ + y) — <i) = yh(x) — xA(y); 


that is, 


1 1 

x (40 shy bry” ai) =y (09 she box? <i) : 

so that 
1 

E(x) + h(x) — €0) = d(x) = she + box? +b3x+c, forx ER, 

and (using (7.85)) 
1 3, 1 2 1 

(7.86) (x)= Pe + zoe + b3x — 5A@) +c forx ER, 


where b3 is a constant. 
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From (7.86), (7.85), (7.68), and (7.66), we obtain 
1 
fQx)= shix + box3 + b3x? + dx +b, 


2 1 1 
k(x) = zhie ~ xbox” t+ b3x— AQ) +e3 forx eR. 


This proves the theorem. 


Corollary 7.30. (Sahoo and Riedel [722]). The general solutions f,s : RR > R 
satisfying the functional equation 


7.87) f@)-—fOP=@—-ys@+y)+@t+y)s@—y) forx,yeR 


are given by s(x) = bix3 + A(x), f(x) = 2b1x++2xA(x) +b, where A:R—>R 
satisfies (A) and b, by are constants. 


Proof. Setting x = y in (7.87) shows that s(0) = 0. Put x = 0 and y = 0 separately 
in (7.87) to obtain 


f(y) = ys) — ys(-y) +b, f(x) = 2x8(x) +b, 


where b = f (0). Clearly we see that s is odd and (7.87) becomes (7.77). The result 
now follows from Theorem 7.29. 


Corollary 7.31. [722]. The functions h, f : R > R satisfy the functional equation 
(7.69) (x — y)(A@ + y)— A(x) —A(y)) = yO) — XO), 
forx,y ER, if, and only if, 

h(x) = bix? + box” + A(x) +b, d(x) = 3b1x?° + 2box? + b3x +c, 


where A: R > R satisfies (A) and b, bj, b2, c are constants. 
The result follows from Theorem 7.29. 
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Nondifferentiable Functions 


Nondifferential functions, Weierstrass functions, Vander Waerden type functions, 
and generalizations are considered in this chapter. 

In classical analysis, one of the problems that has fascinated mathematicians 
since the end of the nineteenth century is ‘Does there exist a continuous function 
that is not differentiable?’ It is an interesting question. Motivated by this exciting 
question, many well-known mathematicians, starting with Weierstrass (1872) [827], 
started to work in this area to produce such a function. It is well known that the 
answer is affirmative. This chapter is devoted to listing several continuous non- 
(nowhere) differentiable functions (c.n.d.f.s). What is of interest to us and is the 
primary motive of this chapter is to show that most of the well-known examples can 
be obtained as solutions of functional equations, highlighting the functional equation 
connection. Kairies’s [412] report is an excellent survey article and a main source 
for this chapter. 

First we give many well-known examples due to Weierstrass, Takagi, van der 
Waerden, Knopp, Wiinderlich, etc., along with the associated functional equations. 
These functions can be characterized as solutions of the associated functional equa- 
tions. The properties of these functions, especially their nondifferentiability, can be 
deduced directly from the functional equations without making use of their explicit 
analytic representations. Then we give a general system of functional equations from 
which all the above can be obtained as special cases. 


8.1 Weierstrass Functions 


The first widely known example of a c.n.d.f. was given by Weierstrass in 1872, 
when he reported it to the K6nigliche Akademie der Wissenschaften in Berlin. Even 
though it appeared in his Mathematische Werke ITI [827] in 1895, it had been dis- 
cussed in a paper by du Bois-Reymond (1875) [233]. Weierstrass proved that 


fore) 
: 1 
W(x) = ye sin (o" -bax + =)> forx € R, 
n=0 
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has nowhere a finite or infinite derivative if b is an odd integer and ab > 1+ : that 
is, W isac.n.d.f. 

Several authors then gave slight improvements, and Hardy [340] in 1916 pre- 
sented the optimal result. In [214, 220] the general Weierstrass function Wa,p,o : 
R — R was introduced by 


CO 
Wa,b,o(x) = >a” sin (b" bax +0) forx eR, 
n=0 
where a € ]0, 1[,b € R,, 0 € [0, 2z[, and it was proved that W,.p,9 is ac.n.d.f. for 
ab>1,be var 
Note that W is a special case of Wu,p,9. Now comes the associated functional 
equations. W,.p,9 satisfies the functional equations 


(8.1) f(x) = af (bx) +sin(bax+6), forx ER, 


(8. 1a) t (- - ) = (-1)"af (x) + (—D* sin (zx + 0), 


forx €[0,1],.0<k<b-1,beZ}. 
Here is the functional equation characterization of Wa_,p,0. 


Result 8.1. /f f : R > R is bounded and satisfies (8.1) for some b € R+\{1}, then 
f= Wa,b,0- 
Result 8.la. /f f : R — R is bounded, of period 1, and satisfies (8.1a) for some 
be Z4\{1}, then f = Wa,p,0. 

Another c.n.d.f., given by Kairies in [414], is 


CO 
Sp(x) = > p "sin (2zp"x), 
n=0 


where p is a fixed prime. For a fixed prime p, the function S, : R > Risac.n.d.f. 
Sp satisfies many functional equations: 


f(x+I)=f), f(-x)=—f (x), forx eR. 


If f : R > Ris bounded and satisfies the equation 
1 
f(x) = —f(px)+sin2zx forx eR, 
P 


then f = Sp. 
If f : R > Ris bounded with period 1 and satisfies the equation 


k 1 k 
(= ) = 5109+ sin2n ,; Veksp—1,2 €(0,1), 
P P P 


then f = Sp. 
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Assume that f : R — R is continuous and satisfies 


f@+l=f@), frx)=—-f@), xeER, 


ie = u(p) f(a), reR, 


P pet 


where u(p) = + if p = q",n € {0,1,2,...},u(p) = 0 otherwise, and f (x) = 
sin (4 ). Then f = Sq. 


8.2 Wiinderlich’s Function 


Wiinderlich [834] defined a function W : [0,1] — R as follows. If x € [0, 1] has 
3-adic expansion 0, x; x2 x3 ..., then W(x) = 0, y1 y2 y3 ... is the 2-adic expansion, 
where x9 = 0 = yo and yg = yg_ if and only if xp = xg_}. 

Wiinderlich [835] attributed this example of a c.n.d.f. to P. Finsler. However, he 
was the first to present a system of functional equations satisfied by W and gives a 
functional equation characterization of W [834]. 

W satisfies the functional equations 


1 1 
f@)= 54 Gx) for x € o, | ; 
foy=5+r(x-3) forx e| 5): 

2 3 3°2 

1 2 
f(x) = fd —-x) forx e| 5 5) 


and 


f(x)=1-f(l-x)  forxe Bak 


If f : [0, 1] ~ Ris continuous and satisfies these equations, then f = W. 


8.3 Takagi Functions 


The Takagi function T : R > R is defined by 


— 1 
Th) = >, 54 2"x), 
n=0 
where d(y) = dist(y, Z). T was introduced in 1903 by T. Takagi [801] as an example 
of a function whose c.n.d. property can be proved much more easily than that of the 
Weierstrass function. T satisfies the functional equations [206, 414] 
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fat)=f@), xeR, 
x 1 x 
f(S)=5f@+5. 


a) 3 

1 1 1 

6 (S)=Fre-F 45, 

oe) x x+1 1 
#5) r( 2 )ee-5. 
1 1 

ro=r(Sar(S )-3 x € [0,1], 
(8.2') f() = 5 fx) +4), x eR. 


In [206], the authors showed that any two of the equations (8.2) imply the other 
two and that every solution of (8.2) must be nowhere differentiable. 

We have the following characterizations by functional equations. Let f : R > R 
be bounded and satisfy (8.2’). Then f = T. Suppose f : R > R is bounded, of 
period 1, and satisfies two of the equations in (8.2). Then f = T. 


Remark. Let g(x) := 2d(x) and denote by g) the nth iterate of g. Then we can 
generate T by iterations of g: 


— 1 
T(x) = D1 8 @). 


n=1 


8.4 van der Waerden Type Function 


The van der Waerden function w : [0, 1] > R is given by 


(8.3) w(x) = >, 5 42"x) for x € [0, 1] 
n=0 


[824, 141, 206], where d(u) is the distance from u to the nearest integer. This 
function is the simplest and best-known example of a c.n.d.f. (compare it with 
the Takagi function T). That w does not possess a finite one-sided derivative 
of any point is proved in [141]. The properties of w, notably its nondifferen- 
tiability, can be obtained directly from functional equations satisfied by w, with- 
out making use of the explicit representation of w (8.3). The proof of nondif- 
ferentiability of w is given in Darsow, Frank, and Kairies [206]. To start with, 
we derive some functional equations satisfied by w (compare them with (8.1) and 
(8.1’)). 

Since d(y) = y when0 < y < 5 andd(y)=1-—y for 4 < y < 1, we have for 
O<x <1 
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CO 
1 x 
= +1 
w(x) =2 > anti? (2" =) 


n=0 


and 


II 
& 


Thus w is a solution of the system of functional equations 


Xx 

(8.1) F(x) —2F (5) = -x, O<x<1, 

ny Wa eS, Cee te 
(8.1'a) r( : ) t(5)= x+5, O<x<1. 
It is easy to check that these two equations imply 
(8.1'b) r)— (5) - 6 (AE) =-5, o<x<1 
2 a i eis 
and 

1 

(8.1’c) w(S )-fey=-x41. O<x<1, 


and w also satisfies the symmetry condition 
(8.1'd) f(x) = fd--), exe 1. 


Remark. (Compare this with (8.1)). It can be readily checked, as mentioned earlier, 
that any two of the equations (8.1), (8.1’a), (8.1’b), (8.1’c) imply the other two. 
In what follows, we tacitly make use of the fact that any solution of a system of two 
of these equations satisfies all four of them. 
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8.5 Functional Equation Characterization of w 


Theorem 8.2. [206]. Let f : [0,1] — R be bounded and satisfy two of (8.1'), 
(8.1'a), (8.1’b), and (8.1'c). Then f = 


Proof. An additional functional equation is derived that leads to the representation 


Oe ee that f(0) = f (1) = 0, we can extend f to a function defined on R by 
(8.4) f@+)=f@). 
Rewrite (8.1'a) and (8.1’c) as 
fOx)=2f@)-2, OSxs5, 
and 
(8.5) f@Qx -1)=2f(@)-20 - x), 5 Sx<1. 
These equations can now be unified to yield the single equation 
(8.5a) f Qx) = 2f (x) — 2d(x) 


for all x € [0, 1], where d is the distance function. 

Indeed, when x € [0. | , (8.5a) is obvious. For x € E 1] , use f(x+1) = 
f (x) for all x € R on the left side of (8.5) to get (8.5a). Indeed, (8.5a) holds for all 
x €R.Letx € R, writex =n+t, withn € Z, t € [0, 1[, and use f and d with 
period | to have 


fQx) = fQn4 24) = f2r) =2f0 — WO 
(by (8.5a)) =2f(n +1) —2d(n +8) =2f (x) —2d(n), 
which is (8.5a) for x € R. 
But it is well known (see [554, pp. 81—82]) that any bounded f : R — R that 
satisfies (8.5a) for all x € R must be of the form (8.3). Indeed, iterating (8.3) gives 


m—1 


ie af" + > 5 <a(2"x) 


Qn 
n=0 


for all m € Z+; and x € R, whence the boundedness of f yields 


[o,@) 
1 
(8.3) fos >, a 2"x), xeR. 
n=0 
The extension procedure and the treatment on the larger domain R did not affect the 
original f on [0, 1]. Thus f(x) = w(x) for x € [0, 1]. 


It is remarkable that the function given in (8.3) satisfies equation (8.1’b) for all 
x € R. In fact, any function f of period 1 that satisfies (8.1'b) for x € [0, 1] nec- 
essarily satisfies (8.1’b) for all x € R. By contrast, none of the equations (8.1’), 
(8.1’a), (8.1’c) has a solution on R that is bounded. On the other hand, there do exist 
solutions of (8.1’), (8.1’a), (8.1’b), (8.1’c) on [0, 1] that are unbounded. 
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8.6 Nondifferentiability of w 


Theorem 8.3. [206]. Suppose f : [0,1] > R satisfies two of the equations (8.1'), 
(8.1’a), (8.1’b), (8.1’c). Then f is nowhere differentiable. 


Proof. We will show that f has no right derivative. Now x = 0 in (8.1) gives 
f(0) = 0, and in (8.1’a) f (+) = 1. From (8.1/a) with x = 1, we get f(1) = 0. 
Rewrite (8.1') to obtain 


(8.1'a’) fQx) =2f (x) — 2x for0 <x <5, 
which by iteration gives 

f(2"x) = 2” f(x) -—m2"x forO<x < = 
or 
(8.1'a") t(Z)asf@t me foro<x<1 


Thus f (2) = =m. 

Now 2! i f (0) 
4m— —0 

Rewrite (8.1’a) to have 


=m, showing thereby that f has no right derivative at 0. 


(8.1'a"”) f ee apy Gres ieee 
2 2’ ~ ~~ 2 


Equations (8.1’a’””) and (8.1’a””’) yield 
1 
f{xt+ am 
1 
-1 
= 1 [er ("++ 3)| 


1 m—1 1 m—1 1 'ql! 
= F(Z at spe m—Y 2 x t+5)) oy (B.1a") 


= [re 1 x)+ +5 _ 2 +4 (m _ 1) (« +4 =) (by (8. 1’ a’”)) 


= 5m— >m—1 
(8.6) 


= f@)—2e+ 5 (by (8.1'a")). 


Fix a number x; € (0, 1], and consider its nonterminating dyadic expansion 


1 
- at = bins * 
ere 1l<n, <n < »nmgeZi. 
A=1 
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ie,2) 
Let xz := >> srr, so that 
d=k 


1 
Xp = ame + x41 and 0 < xp41 < mk 
From (8.6), we obtain 
Nk 
f (XK) = f O41) + Bing Xk 

which can be applied recursively to yield 

nN} Nk 
(8.7) FGA) = Fer) + a be + aa — 22 ++ + ret). 


Next, let yx := x — xr Formula (8.7), applied to yz, gives 


Fee eee ee 


21 QNk-1 


1 1 
=2] (12 Sp) +--+ (mee) +e]. 


f1)— fFO%K) _ 
Ra = 
Te 


Thus 
(8.8) ny — 2k +2(1 - 2"! xR41) 


holds for O < uz < 1, where 


Suppose that x; is a dyadic rational. Then there is a p € Z% such that ng = 
k — p+MNp for all k = p. It follows from (8.8) that 


PN) _ ii paw) —oo ask > Ow; 

2k 
thus f cannot be left differentiable at x. Similarly, it is proved in [206] that f is not 
left differentiable at x; when x is not a dyadic rational. Thus f has no left derivative 
at any x € JO, 1]. 


Remark. The preceding argument actually proves a stronger statement than the 
conclusion of Theorem 8.3: For each x € ]0, 1], the left-hand derivative of f at x 
does not exist. Therefore, since w satisfies (8.1’d), it follows at once that both the 
left- and right-hand derivatives of w fail to exist at each x € [0, 1]. 
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8.7 Riemann’s Function 


The celebrated Riemann function r : R > R is defined by 


r(x) := > = sin(n7x). 
n 


n=1 


Its first appearance in the literature was in 1875 in P. du Bois-Reymond’s paper 
[233]. This function r had already been discussed in 1872 by K. Weierstrass in 
his Akademie lecture, which was published in 1895 [827]. In both papers, r is dis- 
cussed in connection with the c.n.d. property, but neither of the two authors give a 
clear statement about the nondifferentiability properties of r. 

In 1916, G.H. Hardy [340], in his paper on the Weierstrass c.n.d. function, was 
the first to prove substantial facts about the nondifferentiability of r, but the final 
result was first obtained in 1971 by J. Gerver in his papers [311] and [312]: r is 
differentiable exactly in the points z2,, = a eae A, pu € Z, with r’(Za,y4) = —5 
forall A, uw € Z. 

Since then, many papers on the history and the nondifferentiability, as well as 
other properties, of r have appeared. 

In the papers on r, only the following functional equations occur: 


r(x+2z)=r(x), r(—x)=-r(x), rxt+ta)= sr (a) —r(x) xeER. 


Now we consider a modified r, the function R : R > R, given by 


[ee 


R(x) :=r(Qrx)= ss = sin(2zn*x). 
n 


n=1 


The functional equations for R are simpler than those for r. R satisfies the following 
functional equations: 


f@+)=f@), 


ss F=f) 
rG)rr(F ) = 500. xeR 


(compare these with the Takagi equations). However, R cannot be characterized as 
the unique continuous solution of (8.9). A counterexample is given by 


(oe) 
1 
ci > zr sinQ2x4"x). 


n=0 


R* is ac.n.d.f. of Weierstrass type, which satisfies all the equations of (8.9). 
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The function R satisfies the following functional equations: 


2g- 
m-q-1 ee 


1 
(8.10) > f ) =-—f(qx), x €R, m,q € Z}, square-free. 
q 
k=0 


mq 
Particular cases of (8.10) are 


m?—1 


(8.102) > fF (: + 
k=0 


) = Feo. xé€R, me Zi g=1), 


m 


g=1 

k 1 

(8.10b) >) f (- Ls ) =—f(qx), xR, q € Z% square-free (m = 1). 

q q 
k=0 


The following characterization of R is in some way analogous with S, in [414]. 
Suppose that f : R — R is continuous, of period 1, odd, and satisfies (8.10) for 
every prime g with f (4) = a Then f = R. 

The proof is similar to that of Theorem 8.2 but more involved. The func- 
tional equations (8.10b) for primes q imply the following restrictions on the com- 
plex Fourier coefficients of f: kcx = c1,kc_~ = c_, if k € N is a square and 
Ck = c-x = 0 otherwise. Then f(x + 1) = f(x) and f(—x) = —f(x) imply 
that the Fourier series of f coincides with R. The conclusion is obtained by Fejér’s 
Theorem. 


Remark. Both systems (8.10a) and (8.10b) have nontrivial real analytic solutions. 


There are generalizations of Riemann’s function: In 1916, G.H. Hardy [340] 


investigated 
[o,@) 


1 
Ag(x):= > ce sin(n’xx) forl<a <3. 


n=1 


8.8 Knopp Functions 


The general Knopp function Ka,p,¢ : R — R is defined by 


ee) 


Kegg) = > a's" d, 


n=0 


where |a| < 1, p € Zi\{1}, and g : R > Ris of period 1. 

Many results on the c.n.d. property in special cases have been obtained by 
K. Knopp (1918) [541, 542] and others. The most general ones are due to 
R. Girgensohn (1994) [319]. He showed that: 


|a|p > 1 and g’ bounded implies that Ka,p,¢ is c.n.d. or Kj, . is bounded, 
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|a|p = 1 and g” bounded implies that Kq,p,¢ is c.n.d. or bee is bounded, 
g'(0) £ Oand g” bounded implies that K1/p,p,¢ is c.n.d. 


Ka,p,g Satisfies the following functions equations (a, p, g as above): 


(8.11) f (x) = af (px + gx), xeéR, 


(8.11a) (=) =afy+e(*), 0<k<p—1,x€ [0,1]. 


We have characterizations according to Results 8.5 and 8.6. 
Let f : R — R be bounded and satisfy (8.11) with a continuous g. Then 


f = Kayp,g- 
Suppose that f : R — R is bounded, of period 1, and satisfies (8.11a) with a 
continuous g. Then f = Ka, p,g- 


Remark. K,,p,¢ is another generalization of Takagi’s T. K1/10,10,¢ is the famous 
van der Waerden function [824], which played a prominent role in the history of 
c.n.d.. 


8.9 Generalization 


Most of the equations studied so far fall under a system of functional equations in a 
single variable of the form 


(8.12) > flm@l=af@)+s@), xelCR, f:1>R, 


v=1 


where h, : J — R is a given rational function having a very simple structure, 
g : I — Risa given function, and a € R is a given real number. Some important 
particular cases are 


(8.13) f(x) = af (bx) + g(x), 
(8.14) f(A) = a1F004 2000, ke {0,1,...,p — 1}, 


125 k 
(8.15) -Sr(S ) = wnrreo. peScNn. 
oar: p 


Equation (8.13) is referred to as an iterative equation. 
These equations were applied in various characterizations of c.n.d.f.s along with 
the following result. 


Theorem 8.4. (Rathore [690]). Let g : R — R be continuous and bounded, |a| < 1, 
and b € R. Then there exists exactly one bounded function f : R — R that satisfies 
the iterative equation (8.13). The function f is continuous and is given by f(x) = 


CO 
> a" g(b"x), x ER. 
n=0 
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Proof. Assume that there is a bounded solution f : R — R of (8.13). Iteration gives 


m—1 


f(x) — a” f (bx) +4 > a" g(b"x) 


n=0 
for every m € Z7_ and x € R. As f is bounded and |a| < 1, necessarily 


(oe) 


f(x) = Do a"g(b"x) 


n=O 


for every x € R. The assumptions on g and a imply that the function given by the 
series is continuous, and it is easily checked that it satisfies (8.13). 


Result 8.5. (Girgensohn [317]). For fixed p € {2,3,4,...}, let gx : [0,1] > R be 
continuous, |ax| < 1 for0 <k < p —1, and assume 
ak-1 


——-8p-1(1) + gr-101) = 
1— ap-1 


ak 


go) + gx), 1<k<p-l. 
l—ao 


Then there exists exactly one bounded f : [0,1] — R that satisfies the system 


x+k 
(8.14) r( P ) = 0.700) + auto x €[0,1],0<k<p-l. 


The function f is continuous and is given (in terms of the p-adic expansion of x) by 


(Ze) E(Os)(E5) 


n 
are n=1 \k=1 k=1 P 


Result 8.6. (Kairies [414]). Let u € €! be completely multiplicative; i.e., u(1) = 1, 
u(mn) = u(m)u(n), for every m,n € Z%.. Then there is exactly one continuous 
f : R= R that satisfies 


1 


- 
(8.15) 2 f (- i 2) =u(p)f(x), x €R, 
Po P 


for every prime p, is of period 1, is odd, and is normalized by b, = | (the first 
oe} 
Fourier sine coefficient of f). The function f is given by f(x) = > u(k) sin2zkx. 


Most of the well-known examples of c.n.d.f.s are obtained as solutions of func- 
tional equations, and the nondifferentiability is deduced from the elementary func- 
tional equations, without making use of the explicit representations. Further, there 
are several c.n.d.f.s, starting from Weierstrass’s, and plenty of functional equations 
connected with them. 


9 


Characterization of Groups, Loops, and Closure 
Conditions 


Groupoids, quasigroups, loops, semigroups, and groups are considered in this chap- 
ter. Closure conditions, isotopy, and groups are characterized by various identities. 
Functional equations arising out of Bol, Moufang, and extra loops are considered. 
Mediality, the left inverse property, Steiner loops, and generalized bisymmetry are 
treated. 

This chapter is devoted to algebraic identities (and their generalizations leading 
to the involvement of functional equations) connected to groups and well-known 
special loops such as Bol, Moufang, left inverse property (1.i1.p.), and weak inverse 
property (w.i.p.). 

An identity in a binary system (such as transitivity, bisymmetry, distributivity, 
Bol, Moufang, etc.) induces a generalized identity—a functional equation—in a 
class of quasigroups. We treat several of them in this chapter. First we give some 
notation and definitions. 


9.1 Notation and Definitions 


Groupoid A (nonempty) set G endowed with a binary operation (-) is called a 
groupoid; that is, foralla,be G,a-beG. 

Left (right) quasigroup A groupoid G in which ax = b (yc = d) has a unique 
solution for x (for y) for all a,b (c,d) in G is called a left quasigroup (right 
quasigroup). 

Quasigroup A groupoid G in which the equations ax = b and yc = d are solvable 
uniquely for x and y, given a, b,c,d in G, is called a quasigroup; that is, for 
ab = c inG, given any pair, the third element is uniquely determined. In other 
words, a groupoid that is both a left and a right quasigroup is a quasigroup. 

Loop A quasigroup with an identity is called a loop. 

Semigroup An associative groupoid is a semigroup. 

Group An associative quasigroup (or loop) is a group. 

Left cancellative (1.c.) A groupoid G is said to be left cancellative if ax = ay in G 
implies that x = y. Right cancellative (1.c.) is defined similarly. 
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9.2 Closure Conditions 


R-groupoid A groupoid G with the property that, for all x;, yj e G @ = 1,2, 3,4), 
the equations xj y2 = x2y1, X14 = X23, X4y1 = X3y2 imply x34 = x4y3 is 
called an R-groupoid. This closure condition is called the Reidemeister condi- 
tion. Geometrically, it means the following. 


If there are third lines through (x; y2,x2y1), (*1y4,x2y3), and (x3y2, x4y1), 
there is a third line through (x3 y4, x4y3); that is, the figure closes at (x3 ya, x4y3). 

T-groupoid A groupoid G is said to be a T-groupoid if, for all xj, yj € G Gi = 
1,2,3), the equations X1¥2 = X2¥1, X1¥3 = X31 imply X2V3 = X32. This 
closure condition is known as the Thomsen condition. The following figure is 
self-explanatory. 


H-condition (Hexagonal condition) For x1, «2, x3, y1, y2, y3 € G, ifx1y2 = x21, X13 = 
x2y2 = x3y1 implies x2y3 = x3y2, G is said to satisfy the closure condition 
known as the hexagonal condition. 


9.3. Groups 3/3 


The H-condition implies power association, that is, 


x ayn -_ ay" Ly 


n 


for allx, y € Gandallm,ne€ Z. 

Isotopy Let G(-) and H (c) be two groupoids. If there exist three mappings a, f, y : 
G — H that are bijective (one-to-one and onto) such that y (x - y) = ax o By 
for all x, y € G, we say that G(-) is isotopic to H (0) and the triplet (a, f, y ) 
is said to be an isotope. When a = f£ = y, we get the usual isomorphism 
(homomorphism). An isotopism (a, £, Ig) of G(-) onto G(o) is called a princi- 
pal isotopism, where Ig is the identity map of G onto G. 


Fact 1. Every isotope of a quasigroup is a quasigroup [135, p. 56]. 

Isogroup Let G(-) be a groupoid. We say that G(-) is an isogroup (Abelian), pro- 
vided there is a group (Abelian) G(o) that is a (principal) isotope of G(-) such 
that - and o are connected by one of the relations 


(9.1) xoy=a(x)-y or xoy=x-a(y) or xoy=a(x-a(y)), 


where «a is the inverse operator of G(o). 

Left and right translation Mappings Ly, Rg : G > G defined by Lyx = ax, Rgx = 
xa, for all x € G (groupoid), are called left (right) translations or multiplication 
by a. 


Fact 2. Suppose a, 6, P : G > G such that P = af (af)(x) = a(f(x)). If P 
is a permutation, then a is onto and f is 1-1. 


Fact 3. (Aczél [11]). (R) A quasigroup G(-) is isotopic to a group if and only if 
the Reidemeister condition holds in G(-). 

Fact 4. [11] (T) A quasigroup G(-) is isotopic to an Abelian group if and only if 
the Thomsen condition holds in G(-). 


9.3 Groups 


First we consider identities that characterize a group. A variety of groups is a class 
of groups that can be defined by means of a number of identical relations or laws, as 
they are sometimes called. Alternatively, it is a class of groups that is closed under 
the formation of direct products, homomorphic images, and subgroups. 

One of the problems of the varieties is to define some class of varieties by single 
identity. This problem has been solved for groups, Abelian groups, loops with the 
inverse property, etc. It is known for groups that certain varieties of groups may be 
defined by a single law in terms of multiplication and inversion; of right division. 
It is known that each variety of groups that can be defined by a finite system of laws 
as a subvariety of the variety of groups can be defined by a single law as a subvariety 
of the variety of groups. In particular, the variety of all groups can be obtained from 
that of groupoids by a single law. 
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Here are some of the ways groups can be defined: 


I In terms of multiplication (-) and inversion (—1). 
(-) is associative, there is an e (identity) such that xe = ex = x, and there is x~ 
(inverse to x) such that xx~! = x-!x =e. 

II (-) is associative; equations ax = b, yc = d are solvable for x, y. 

Il By four laws: xx~! = yy~! = e; xe7! = x = e(ex—!)"!; [x(ey"!)“!] - 
Ce yaar) 1 ee ee 

IV [{y-[z- (xv)]}7! - (vz)]7! - v7! = x (defined as a variety of groups by a single 
law). 

V (Wimmer and Ziebur [832]). Let G be a set with binary operation (—) (called 
subtraction or difference) such that, for all x, y, zin G, (i) x—y € G; (ii) there is 
an element 0 € G such that x—y = Oif and only if x = y; (ili) @—z)—(y—z) = 
x — y hold. 

Define + (called addition) in G such that 


1 


xty=x-—(0-y). 


Then G(+) is a group. 


9.4 Abelian Groups 


The commutative law may be written as (iv) x — (x — y) = y ((iv) is obviously 
equivalent to x + y = y + x in any group). 

A necessary and sufficient condition for G to be an Abelian group is that it satisfy 
(i), (iii), and (iv) or (i), Gi), and (v) (W —z)-(*-y)=y—z. 


Characterization of Groups Satisfying a Single Identity 


An identity in a binary system induces a generalized identity (functional equation) 
in a class of quasigroups. For example, the well-known identities of transitivity, 
distributivity, Bol, Moufang, I.c., and extra loop 


(9.2) xy -Zy = Xz, (transitivity) 
(9.2a) XY +-XZ= yZ, 

(9.3) xy + Zy = ZX, 

(9.3a) XY XZ =Zy, 

(9.4) X*yZ=XY-XxXZ, (distributivity) 
(9.5) y- (+ yx) =(y-zy)-x, (Bol (left)) 
(9.6) (xy-x)z=x-(y-xz), (Moufang) 
(9.7) (x-xy)-z=x-(x- yz), (Le. identity) 


(9.8) yx + zx = (y-xzZ)-x, (extra identity) 
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give rise to the generalized associativity, bisymmetry, transitivity, distributivity, Bol, 
Moufang, l.c., and extra loop functional equations on quasigroups, and these equa- 
tions on quasigroups will be investigated later. We study and solve the generalized 
functional equations on a more generalized algebraic structure than quasigroups, 
namely the GD-groupoids. 

First we treat group identities. 


Theorem 9.1. (See Aczél [12] and Kannappan [433, 435].) Let G(-) be a groupoid 
satisfying the transitivity identity (9.2) with left cancellativity (I.c.). Then G(-) is an 
isogroup. 


Proof. Given xz - yz = xy + (Lc.), suppose a- a = e for somea € G. Setx =y = 
z=ain(9.2)tohavee-e=e. 

Putx = y =e in (9.2) and use Lc. to get e- ze = ez; that is, ze = z forall z € G. 

Set z = y in (9.2) to obtain xy - yy = xy = xy -e so that Lc. yields yy = e for 
ally €G. 

Put y =x, z =e in (9.2) to have e- ex = x forall x. Define o on G by 


(9.1) xoy=x-ey=x-Bly), Bly) =ey. 


f(ex) = x and Lc. implies f£ is 1-1. Hence G(-) is isotopic to G(o) (UZ, £, J) is 
the isotopism). We will show that G(o) is a group; that is, (0) is associative and 
aox=b, yoc =d are solvable. 

y =x in (9.2) gives 


(9.9) e-zx =xz forallx,z eG. 
Using transitivity, (9.9), and (9.1), we have 

(Koy)oz=(x-ey)-ez 

= elez- (x - ey)] by (9.9) 

e-[(ez- y)-{(x- ey)- y}] transitivity 
=e-[f{e-(y-ez)}-{@ - ey) - (e- ey)}] 
=e-[{e-(y-ez)}-x] 
=x-(e-(y-ez)) 
=xo(yoz); 


that is, o is associative. 


Solvability 
ao(ea-eb) =a: {e(ea-eb)} 
=a-(eb-ea) by (9.9) 
= (e-ea)- (eb- ea) 
=e-eb (transitivity) 
=b 


shows that a o x = b is solvable. |.c. implies the uniqueness of the solution. 
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baoa = ba-ea = be = b shows that y oa = bis solvable. As before, l.c. 
implies the uniqueness of the solution. Thus G(o) is a group and G(-) is an isogroup. 
Note that e is the identity for (0). 

This proves the theorem. 


Remark 9.2. Assume (9.2a) and r.c. By defining a * b = b - a, (9.2a) goes over to 
(9.2) with l.c. Hence G(-) satisfying (9.2a) + rc. is an isogroup. 


We now show that G(-) satisfies the R-condition, thereby proving that G(-) is 
isotopic to a group. 

Suppose x;y2 = x2y1, X14 = xX2Y3, X4y1 = X32. Then yoyq = xp y2- x14 
(by (9.2a)) = x2y1 - X33 = Y1Y3, 2X3 = X32 -X3X3 = X4y1 -X4xX4 = y1x4. Now 
X3Y4 = yox3- y2v4 = yixX4- V1 Y3 = X4y3. Thus the R-condition holds. By Fact 3, 
(note that G(-) is isotopic to a group, so a quasigroup G(-) is isotopic to a group. 


Now we consider identity for an Abelian group. 


Theorem 9.3. [438, 433]. Suppose (9.3) holds with l.c. in G(-). Then G(-) is a 
principal iso-Abelian group. 


Proof. We prove this by using the T-condition. It is easy to show that, as in Theo- 
rem 9.1, 


(9.10) xx =e, ex=x, xy=yx-e, forallx,yeEG. 
Using (9.3) and (9.10), we have 

(9.11) xy-uv = (yx-e)-(vu-e) =ovu- yx. 
To prove the 7 -condition, let 

(9.12) X1y2=x2y, and x, y3=Xx3y1. 

Now (9.3), (9.11), and (9.12) yield 


(9.11) (9.12) 
Y2Y3 = Y3X1* Y2X1 = X1Y2° X13 = X2V1 + X3ZY1 = X3X2 
and 
(9.3) (9.11) (9.3) 
X3Y2 = Y2Y3° XZ YZ = X3X2°XZYZ = Y3XZ°X2XZ = X2Y3.- 
Hence the T-condition holds and G(-) is isotopic to an Abelian group. Isotopism is 
given by 
(9.1) xoy=xe-y, 
yox=ye-x 
= (x - xe): (ye- xe) 
= (x - xe): (xy) 


(9.3) 
= xe-y=xoy. 


This proves the theorem. 
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Theorem 9.4. (Hecke [362], Kannappan [435]). The variety of groups is the variety 
of all groupoids G(-) defined by the single law 


(9.13) x-[{(xx-y)-+z}-{Qx-x)-z}]}=y forallx,y,z€G. 


G(-) satisfying (9.13) is an isogroup. 


Proof. We will show that G(-) is a quasigroup satisfying the transitivity (9.2). 
Using left and right multiplication, (9.13) can be written as 


(9.14) Ly Rex-x)z* Rz Lux = Id, 


for all x, y,z € G, where Id is the identity map. By Fact 2, we see that L;, is 1-1 
and L, is onto, so that L,, is a permutation. By shifting L,, to the right side, we 
conclude that R; is 1-1. 

Replace x by xx in (9.14) to get 


(9.14a) Lay R((ex-xx)-xx)z* Re + Lxx-xx = Id. 


Since Ly, is a permutation, we see that R((yx-xx).xx).z Is onto. But R, is 1-1. Hence 
R((xx-xx)-xx)-z 18 a permutation. Then, from (9.14a), we obtain that R; is a permuta- 
tion. Now (9.14) shows that Ly is a permutation. Hence G(-) is a quasigroup. 

Set y = x in (9.13) to have 


x + [Cex + x)z}- {Ocx -x)z}] = x. 

Using z as a variable, we get x -uu = x = x - vv forallu,v € G, yielding 
u-u =constant =e (say) forallu eG, 

and xe = x. Then (9.13) becomes 


x-[(ey-z)-(ex-z)]=y, forallx,y,z €G, 


=x-(ey-ex) 
(take z = e) and also e- ex = x (y =e). Thus, using l.c., we get 
(ey -z)- (ex-z) =ey-ex, 


which is yz -xz = yx (replace x, y by ex, ey). This is transitivity (9.2). By Theo- 
rem 9.1, G(-) is an isogroup. 


Theorem 9.4a. [362, 435]. The variety of Abelian groups is the variety of all 
groupoids defined by the identity 


(9.13a) xX-(yz-yx)=z forallx,y,z€EG. 


A groupoid G(-) satisfying (9.13a) is an iso-Abelian group. 
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Proof. The proof is similar to that of Theorem 9.4. We will show that G(-) is a 
quasigroup satisfying (9.3a). 
Using right and left multiplication, (9.13a) can be written as 


(9.14a) Ly,RyxLy =Id_ forx,y eG. 


By Fact 2, Ly is 1-1 and L, is onto, so that L, is a permutation and then Ry, is also 
a permutation. Using L, as a permutation, we can get that yx is any t € G so that R; 
is a permutation and G(-) is a quasigroup. Then (9.13a) yields (using y as variable) 


YX + yX = UX + UX; 


that is, ¢-¢ = e for anyt € G. 
Putting y = x = z in (9.13a), we have xe = x, and y = x in (9.13a) gives 
x +-xz =z. Then (9.13a) becomes 


YZ+ yx = XZ, 


which is (9.3a). Thus G(-) is isotopic to an Abelian group. 


Result 9.5. (Legendre [604]). A groupoid G(-) satisfying xz - yz = xy (9.2) tran- 
sitivity) and x - (yy + y) = y isa group. Further, if it satisfies x -xy = y, then it is 
an Abelian group (see Theorems 9.1 and 9.3). 

Result 9.6. (Stamate [765]). A groupoid G in which to each element x there corre- 
sponds an element x' is a group if and only ifab-c = ad -u implies b =d-uc' (a 
single axiom in terms of two operations, - and unary). 

Result 9.7. Let G(-) be a groupoid in which to each element x there corresponds an 
element x' in G. Suppose the following postulate holds: If aa’. b' = rs’ - t’ implies 
b = tr’-s, then G(-) is an Abelian group (a single axiom in terms of - (multiplication) 
and unary operation (inversion)). 


Theorem 9.8. (Morgado [640]). Let G(-) be a groupoid satisfying that the condition 
(9.15) [a- (bb-b)]- (cce-c) = [a- (dd-d)]- (uu -u) 

implies b = d - eu. Then, if one defines 

(9.1) aob=a-(bb-b) fora,beG, 

then G(o) is a group isotopic to G(-). 


Proof. Briefly outline the steps leading to the transitivity equation (9.2) using (9.15) 
a few times. Taking d = b, u = c in (9.15), we get b = b- cc and in particular 
bb = bb-cc = bb- bb and then bb = constant = e and b = be and e- eb = b. Now, 
for transitivity, 


ab = [e- {(ab-ab) - ab}] - (ee - e) 
= [e- (aa-a)]-[(eb- eb) - eb], 
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which by (9.15) yields 


a=ab-eb 
= fe eh- ab) apy bh<d) 
= [e - {(ac- ac) -ac}]- (cc +c). 


Again by (9.15), we get ab = ac - bc, which is (9.2). Now use Theorem 9.1. This 
proves the result. 


A groupoid G(-) is said to satisfy a generalized associative law (g.a.l.) provided 
there exists a pair a, b € G such that 


(9.16) [(@ - by)-a]-z=x-[b- {(ya-z)}] 


for allx, y,z € G. 


Result 9.9. (Fotedar [286]). [fa groupoid G(-) satisfies (9.16), then G(o) is a semi- 
group, where o is defined by 


(9.17) xoy=xa-by forx,yéG; 


that is, G(-) is isotopic to a semigroup. 
Further, if G(-) is a quasigroup satisfying (9.16), then G(-) is isotopic to a group 
G(o0), where o is given by (9.17). 


A groupoid G(-) satisfies the law (known in [264] as generalized associative) if 
there exist permutations P;, Q; (i = 1, 2,3,4,5) from G to G such that 


(9.18) Ps5(P4(Pix - (Poy - P3z)) = Q5(Q3(Qix - Q2y)) - Qaz) 
holds for all x, y,z € G. 
Result 9.10. (Evans [264]). 


(a) A finite groupoid with a unit and satisfying (9.18), where P; are permutations, is 
associative (that is, a semigroup). 

(b) A groupoid with a unit, division on one side, and satisfying (9.18), where P; and 
Q; are permutations, is a group. 


Abelian Groups 
Suppose G(-) is a groupoid and 


(9.19) x-le-{fy-@-oH=y 


holds for x, y, z € G. We will prove that G(-) is a quasigroup and (9.3a) holds. 
Replace z by z - (y - x) in (9.19) and use (9.19) to have 


x-[z-(y-x)-{y-@-@-Ox)))] =y; 
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(9.20) x-[{z-Qy-x)}-z]=y forx,y,zEG, 
which can be written as 

(9.20a) L,R,L,R, = Id. 


So, R, is 1-1, (-) is rc., and L, is onto. 
Suppose ya = yb. Then (9.20) yields 


a-[{z-(ya)}-z]=y =b-[{z- (yb)} -z]. 


r.c. implies a = D; that is, (-) is lc. Hence Ly is a permutation and (9.20a) shows 
that R, is a permutation. 
Replace y by x - z in (9.19), and use I.c. to obtain 


z-[@-z)-@-z)] =z; 


that is, 
z:-(t-t)=z forz,teG 


(use solvability) and tt = constant = e and ze = z. Nowz = x in (9.19) yields 
x-(-y)=y. 
With y replaced by xy, (9.19) gives 
z-Qry-xz)=y 
or 
z-y=z-[z-@y-xz)] =xy-xz, 


which is (9.3a). Thus G(-) is isotopic to an Abelian group. Thus we have proved the 
following theorem. 


Theorem 9.11. A groupoid G(-) satisfying (9.19) is an iso-Abelian group. 

Theorem 9.11a. A groupoid G(-) satisfying 

(9.19a) xX-(xz-yz)=y, forallx,y,z€G, 

is an iso-Abelian group. 

Proof. We will show that (9.19) holds. Equation (9.19a) can be written as 
L,Ly,Rz, = Id, 


so that Ly is onto or at = b is solvable for t. Set y = x in (9.19a) and use xz = t 
(any t € G) to get 
xX -+ (tt) =x. 
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Now z = ff in (9.19a) yields 
x-(Q-y=y. 
Finally, replace z by xz in (9.19a) to have 


x-[z-Q-xzl=y, 


which is (9.19). This proves the theorem. 


Theorem 9.11b. A groupoid in which 

(9.19b) Ix-(z-y)]-@z)=y, forx,y,z€G, 

holds is an iso-Abelian group. 

Proof. We will prove that (9.19) holds. Equation (9.19b) can be written as 
Ry-L,L, = Id, 


which implies that (-) is l.c. since L, is 1-1. Suppose a - y = b- y. Then use (9.19b) 
to get 
[x - (ay)]- (wa) = y =[x- @y)]- @d), 


which on using lL.c. twice yields a = b; that is, (-) is rc. 
Replace y by y - xz in (9.19b) and use r.c. to have 


[x-{z-(y-xz)}]-@-z)=y- (xz), 


which is (9.19); that is, G(-) is isotopic to an Abelian group. This completes the 
proof. 


Theorem 9.11c. (Hecke [362]). In a groupoid G(-), suppose 
(9.19c) x-(@y-zx)=y, forx,y,z€G, 


holds. Then G(-) is an iso-Abelian group. 


Proof. (See also Theorem 9.4a.) Suppose zy; = zy2. Then (9.19c) shows that 
y1 = yg; that is, (-) is l.c. Replace y by xy in (9.19c) and use L.c. to obtain 


(z-xy)-@x)=y, 


which is (9.19b). By Theorem 9.11b, the result follows. 
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In what follows, G will denote either the set of all real numbers R or the unit circle 
T (complex numbers z with |z| = 1). 
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Theorem 9.12. (Kannappan [438]). Let F : G x G > G be an onto, continuous 
map satisfying the functional equation 


(9.2a’) F(F(x, y), F(x, z))=FO,z) forallx,y,z€G. 


Then the set of idempotents S of G is either G or consists of only one element and 


(9.21) F(x,y)=y 

or 

(9.21a) Fy) =f '(Ff0)- f(x), whenG =R, 
or 

(9.21b) Fay) =f'FO)FO)"'), whenG =T, 


accordingly, where f : G — G is some homeomorphism. 


Proof. For x, y € G, define F(x, y) := xy, and for subsets U, V of G, UV := {uv: 
u € U, v € V}. Then (9.2a’) can be rewritten as 


(9.2a) xy-xz=yz forx,y,zeEG. 


Let S = {x € G: x =xx = x7} be the set of idempotents of G. Evidently, from 
(9.2a) it follows that x? - x? = x”, so that x? € S for every x € G. As every element 
in S is the square of itself, S is precisely the squares of elements in G. 

The continuity of (-) implies that S is closed. Further, it is easy to see that S is 
connected. Hence S is a closed interval, which may be a single point. The cardinality 
of Sis either > 1 or= 1. 

First, let us suppose that |S| > 1. Let e € E. Then we have 
(9.22) i: =e forall g € G; thatis, 


Ge=e. 


Indeed, suppose t € Ge M S. Then, by (9.2a) we have 


(9.23) t=tt =ge-ge=ee=e. 


First, let us suppose that e is an interior point of S. If x (4 e) € Ge, then since 
Ge is connected, Ge contains the interval from e to x. But then Ge contains more 
idempotents other than e. But this by (9.23) cannot be, so in this case (9.22) holds; 
that is, Ge =e. 

Now, let us assume that e is an end point of S. Let xe (4 e) € Ge for some 
x € G. Then there are two open neighbourhoods U of xe and V of e such that 
UNV = @. By the continuity of (-), there are open neighbourhoods W of x and N 
of e such that WN C U. Since e € VON, there isat € VMN such that ¢ is an 
interior point of S$. Since (9.22) holds for interior points of S, we have xt = t and 
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thatxt e WN CU, implying t ¢ UN V, acontradiction. Thus (9.22) holds true in 
this case also. 

Let x, y € G. Then, by (9.2a) and (9.22), we have yx - yy = xy and yx-yy = yy 
so that, for every x,y € G, xy = yy € S. The hypothesis GG = G will yield 
G C S; that is, G = S. Thus, when |S| > 1, G = S. Then, by (9.22), we have 


xy=y forallx,y eG, 


proving (9.21). 
Let us suppose that |S| = 1. Let e be the unique idempotent. Then we have 


(9.24) xx =e forallx €G. 


Now we will show that Ga = G for any a € G; that is, there is an x € G such that 
xa = b holds for any b € G. 
By hypothesis, there are y, z, u,v € G such that b = yz anda = uv. Now, using 
(9.2a) and (9.24), we have 
b=yz=7zy-72Z=Zy-ov 
by (9.2a) = (vu -zy)- (vu- vv) 

= (vu -zy)- uv 

= (vu -zy)-a; 
that is, xa = bis solvable for x. 


Hence, by Kannappan [433], G is an isogroup; that is, G(o) is isotopic to a group, 
where o is given by 


(9.1) xXoy=xe-y or (x-ye)-e. 
Further, G(o) is a topological group. So, when G = S, there is an isomorphism 
f : S(02) > SG) 


such that 
f@oy)=fa)+ fO) 
and 


f(ye) = —f 0). 
Since, by (9.1), xy = xe - y, it follows that 


f(xy) = f(xe) + FO) = FO) — FO), 


so that (9.21a) holds. 
Similarly, (9.21b) can be proved. This completes the proof of this theorem. 
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Result 9.13. [438]. If F : G x G —> G is an onto, continuous function, satisfying 
the functional equation (9.3') 


F(F(x, y), F(x,z)) =F, y) forx,y,7€G, 


then 
F(x,y)=f '¢@)-f0)) whenG=R 
or 


Fi, y)=f 'F@)-fO)') whenG=T 


where f : G — Gis some homeomorphism. 
Identity (9.3) can be transformed into identity (9.3a) by defining a binary opera- 
tion (*) in G such that 
xX*ky = yx. 


As a consequence, we have the following result. 

Result 9.13a. [438]. Let F : G x G > G be an onto, continuous function satisfying 
(9.3’) F(F(x,y), F(z, y)) = F(z, x). 

Then F has the form 


F(x,y)= f '(f(y) — f@)) whenG eR, 
Fa,y)=f 'fO)F@)') whenG =T. 


For the functional equation 
(9.2’) F(F(x,y), F@, y)) = FQ, z) 


arising out of the transitivity equation (9.2), see [12, 148]. 


9.6 Functional Equations Arising Out of Bol, Moufang, and 
Extra Equations 


9.6.1 Bol Equation 

From the left Bol identity 

(9.5) x(y@z)) = @(yx))z 

on a quasigroup, we obtain a generalized Bol identity on a class of quasigroups, 


(9.5a) Py (x, P2(y, P3(x, z))) = Pa(Ps(x, Poy, x)), z), 
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where the P;’s are quasigroup operations on a set H. The general solution of this 
generalized Bol functional equation is obtained by reducing it to another functional 
equation, 


(9.25) PQ, y+ S(x,z)) = PO, y+ax)) +z, 


where P and S are quasigroup operations on H and a(x) = S(x, 0). If the operations 
in the last functional equation are considered on real numbers (or groups), then the 
solution of this equation is obtained. 

One of the most important identities considered in the theory of quasigroups is 
the Bol identity. A loop H (-) is called a left Bol loop [114] if the identity 


(9.5) X(y(xz)) = (x(yx))z 


holds for every x, y, z € H. This identity is called the left Bol identity. The right Bol 
identity is defined analogously: 


(9.5’) ((zx)y)x = z((ry)x). 


If a loop is both a right and a left Bol, then it is a Moufang loop; i.e., one of the 
following Moufang identities is satisfied: 


(9.6) x(y(xz)) = (@y)x)z, 
(9.6a) XY + ZX = (K+ yz)x, 
(9.6b) xy zx =x-(yz-x), 
(9.6c) ((zx)y)x = z(x(yx)). 


It is easily seen that (9.6) is a particular case of (9.5’) if H (-) satisfies the elasticity 
law (xy)x = x (yx), and then (9.5) implies (9.6). On the other hand, the left Moufang 
identity (9.6) does not imply (9.5), see, for example, [135]. 

Each identity in a universal algebra defines a generalized identity that is 
obtained from the given identity by replacing operations of the same arity (num- 
ber of variables) by different operations of the same arity. Generalized asso- 
ciativity P|(P2(x, y),z) = P3(x, Paty, z)), generalized bisymmetry P;(P2(x, y), 
P3(u, v)) = Pa(Ps(x, u), Po(y, v)), and generalized distributivity P| (x, P2(y, z)) = 
P3(Pa(x, y), P5(x, z)) are examples of such generalized identities. These identities 
will be considered as functional equations later (for references, see [12]). 

To the left Bol identity there corresponds the generalized Bol (left identity) 
(9.5a). The corresponding identity for the right Bol identity is 


(9.5b) 01(Q2(O3(z, x), y), xX) = Oa(z, 05(Q6(x, y),*)). 


Of course, all operations P;, Q; (i = 1,2,...,6) are defined on the same set H. 

We shall consider equation (9.5a) on quasigroups; that is, we assume that all 
P; and Q; are quasigroups (quasigroup operations). In the following sections, we 
reduce equation (9.5a) to a simpler one containing two quasigroups and one loop, 
and we give a full solution of this equation under some suppositions. 
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We shall use the following notation. Let P be a binary operation defined on the 
set H. We denote the translations of P by 


(9.26) Lp(a)x = P(a,x), Rp(a)(x)= P(x, a). 


If P is one of the operations P; (( = 1,2,...,6) from (9.5a), then we shall write 
L;(a) instead of L p, (a) and, moreover, if a is a fixed element k of Q, then we shall 
write L; instead of L;(k). Similar notation is used for right translations. 

Let 0 be a fixed element of the set H. We denote L p(0) = L’, Rp(O) = R’, and 


(9.26a) xty= PR ey), 


Then H (+) is a loop (Bruck [135]) with the neutral element P(0, 0) = Op. 
P 


Let all the operations in (9.5a) be quasigroup operations. Then L;’s and R;’s are 
permutations of H. If x = k in (9.5a), then from (9.27) we have 


L Po(y, L3z) = Pa(LsRoy, Z); 


that is, 
P4(y,Z) = Li P2(Rg'L5"y, L32). 


Using this and (9.5a), we have 

Pi (x, Pay, Ps(@, 2))) = Li Po(Rg Ls ' Ps(x, Po(y, x), L3z))- 
With z = k, it becomes 

Pi(x, Po(y, R3x)) = LiRjRe'L5'Ps(x, Poly. x)), 

where R4, = R2(L3k). From these two equations, we obtain 
(9.27) Ly" Paw, Paly, Psa, z))) = Po(RaLy' Pie, Pa(y, R3x)), Lz). 
Let 
(9.28) Ci@,y) = Ly'Pi(x, y). 
Now (9.27) and (9.28) yield 


Ci (x, Po(y, P3(x,z))) = Po(R5'Ci(x, Po(y, R3x)), L3z); 


that is, 
es) Ci(x, Po(R5 'y, R3R3' P3(x, z))) 

= P)(Ry'Ci(x, Po(Ry'y, R3x)), L32). 
Let 


(9.30) Co(x, y) = Po(Ry'x, Ray), C3(x, y) = Ry | Ps(x, Lz R3y). 
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With the help of (9.30), (9.29) can be rewritten as 
Ci(x, Ca(y, C3(x, Ry 'L3z))) = Co(Ci(x, Coy, x)), Ry £32); 
that is, 
(9.31) C1 (x, Co(y, C3(x, z))) = Ca(Ci (x, C2(y, x)), z). 


From (9.28) and (9.30), it follows that C;, C2, and C3 are quasigroup operations on 
HA since L1, L3, R3, and R, are permutations of H. 

As every quasigroup is isotopic to a loop [135], we can assume that C2 is isotopic 
to a loop; that is, C2 satisfies 


(9.32) Co(x,y)=Rx+L’y, 
where R’ and L’ are as in (9.26a). Then H (+) is a loop. By (9.32), (9.31) becomes 


Ci(x, R’y + L'C3(x, z)) = R'Ci(x, R'y + L'x) + L'z; 


that is, 

(9.33) Cie, y + L'Ca(x,z)) = R'Ci@, y + L’x) + L’z. 
Now define 

(9.34) PE) =CGD) SHS Ce"). 


Evidently, P and S are quasigroup operations on H. Using (9.34), we obtain from 
(9.33) 
P(x, y + S(x, L'z)) = R’P(x, y+ L'x)+ Lz; 


that is, 
(9.35) P(x, y + S(x,z)) + R'P(x, y+ L’x) +z. 
Putting z = 0 in (9.35), we have 

P(x, y + S(x,0)) = RPO, y+L'x), 
and thus we get 
(9.36) P(x, y+ S(x,z)) = P(x, y + a(x)) +z, 


where a(x) = S(x, 0). 
Hence we obtain from (9.29), (9.30), and (9.34) 


Pi(x,y) = LiCi(x, y) = LiP(@,y), 

Po(x, y) = C2(Rbx, Ry! y) = R’R)x + L'R3'y, 

Ps(x, y) = R3C3(@, Ry Lay) = R3L’"'S@, L'R'Lsy), 

P4(x, y) = Ly Po(Rg | L5'x, L3y) = Li(R/R,L5 x + L’R;'L3y), 
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where P and S satisfy (9.36). That is, with Lj = 6, RR, = 4, LR," =i, 13 = 
y, and a = 0, we can rewrite them as 


Pi(x, y) = P(x, y), 
Po(x, y) = Ax + wy, 
P3(x,y) = w'S(x, nyy), 
Pa(x, y) = P(R'Ox + yy), 


(9.37) 


where P and S satisfy (9.36) and ¢, 2, “, w, R’, and @ are permutations on H. 
From (9.37), we obtain 


P(x, Poly, R3x)) = PP (x, Ay + UR3x) 
= fbP(x, Ay + L'x) 
= 0 P5(x, Poly, x)), 
and thus 
(9.38) Ps(x, Po(y,x)) = 07! P(x, Ay + L'x). 
Thus we have proved the following theorem. 


Theorem 9.14. (Belousov and Kannappan [115]). Let H be an arbitrary set. 
Let P; G = 1,...,6) be quasigroup operations satisfying (9.5a). Then all the 
solutions of the functional equation (9.5a) are given by (9.37) and (9.38), where 
od, 4, u, w, R’, a, 0, and L' are arbitrary permutations of H, and the loop oper- 
ation + and the quasigroup operations P and S satisfy (9.36). Conversely, the P;’s 
@ = 1,2,...,6) given by (9.37) and (9.38), where P and S satisfy (9.36), satisfy 
the generalized Bol equation (9.5a). 


By a straightforward computation, it is easy to verify the converse part. 


Remark (i). The solutions of the right Bol functional equation can be obtained from 
(9.5a) by replacing all the Q;’s in (9.5b) by the P;’s, where 


Pi (x,y) = Qily, x). 
Remark (ii). The generalized Moufang functional equation 


Pix, Poly, P3(x, z))) = P4(Ps(Po(x, y),*), Z) 


can also be reduced to (9.36). By the same computation, we obtain (9.27), from 
which we get (9.31) and finally (9.36). All the solutions are similar to (9.37). The 
only difference is (9.38), where instead of (9.33) we get 


P5(Po(x, y), x) =07' P(x, Ay + L’x). 


We have seen that the solution of the Bol functional equation (9.5a) is reduced 
to that of (9.36). Let us now consider equation (9.5a) on the set of real numbers R, 
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and let us suppose that H (+) is the additive group of real numbers. So, we have to 
consider (9.36) on R. 
Letting S(x, z) = ¢ in (9.36), we get, using S as a quasigroup operation, 


PQ, yt =P, yt+a(x))+5 1,1, x, y,t ER, 
where a(x) = S(x, 0). Thus, we obtain 
(9.39) Ax(y +t) = ux(y) + x(t) forall y,t € R, 
where 
Ax(u) = P(x,u),  Ux(u) = P(Xx,uta(x)),  vy(u) = S71 (x, u). 


Equation (9.39) is the well-known Pexider equation. Hence there exists an additive 
function A, on R satisfying 


Ax(u + 0) = Axy(u) + Ax(v) 
for all u,v € R, such that 
Ax(u) =c(x)+ Ax(u), 


(9.39a) Mx (u) = b(x) + Ayu), 
vx(u) = d(x) + Ax(u), 
where b(x), c(x), and d(x) are constants depending on x with c(x) = b(x) + d(x). 


With the notation 
F(x,u) = Ax(u), 


we obtain 
(9.40) P(x,u) =c(x)+ F(x, un), 
(9.40a) S7!,u) = d(x) + F(x,u), 


where F is additive in the second variable for each fixed x. From (9.40), we see that 
F is a right quasigroup; that is, F (a,x) = b has a unique solution for all a, b. If in 
(9.40a) we put S~!(x, w) = w, then we have 
d(x)+ FQ,u)=v, 
SQ, w) =u. 


Thus, 
(9.41) S(x, d(x) + F(x, u)) =u. 


Since S(x,0) = a(x), we have S~!(x,a(x)) = 0. Thus, from (9.40a) with u = 
a(x), we get d(x) = F(x, —a(x)) using F additive in the second variable. Hence 
(9.41) becomes 

S(x, F(x,u—a(x))) =u; 
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that is, 
S@, FQ, y))=yt+a(x), 


from which follows, using F as a right quasigroup, 
(9.42) S(, y) =a(x)+ F7'(@,y). 
Therefore, we have proved the following theorem. 


Theorem 9.14a. [115]. Let H (+) be the additive group of real numbers. Then the 
general solution of (9.36) is given by (9.40) and (9.42), where F is an arbitrary right 
quasigroup that is additive in the second variable, and c(x) and a(x) are arbitrary 
functions. Conversely, if P and S are given by (9.40) and (9.42) with F additive in 
the second variable, then (9.36) holds. 


The converse part can be obtained by easy computation. 


Remark (iii). If we take P to be monotonic in the second variable, then we see that 
A,x(u) is continuous and, for Ay # 0, Ax(u) = f(x)u for arbitrary f(x). Hence 
P(x,u) = C(x) + B(x)u and Sx, uv) = a(x) + u B(x). 


Remark (iv). Instead of the additive group of real numbers, we can take an arbitrary 
group and consider the Pexider type equation on this group. The general solution of 
(9.36) is given by (9.39a) and hence by (9.40) and (9.42). But the constant functions 
c(x), b(x), and d(x) should be written in a proper way. 


9.6.2 Moufang and Extra Loops 


Let G(-) be a loop. Then it is known that the identities (9.6), (9.6a), (9.6b), and (9.6c) 
are equivalent. 

A loop G(-) satisfying any one (and hence all) of these identities is called a 
Moufang loop (Bruck [135, p. 115]). Corresponding to the well-known identities 
above, the identical relations 


(9.43) (yx -z)-Ax =y-(x- (z-Ax)), 
(9.43a) xy +(z- Ax) = (x- yz)- Ax, 
(9.43b) xy «(z+ Ax) =x-+(yz- Ax), 
(9.43c) (xy-Ax)z=x-(y- (Ax -z)) 


are considered, where 2 : G > G is any mapping. 

We prove the following results regarding these equations. In what follows, we 
make use of the following fact about inverse property loops G(-): If (U, V, W) is 
an autotopism of G, then (W, JV J, U) and (JUJ, W, V), where J is the inverse 
mapping, are also autotopisms of G [135, p. 112]. 


Result 9.15. (Kannappan and Taylor [522]). If (9.42) holds in a loop G(-), then G(-) 
is Moufang and (9.43a), (9.43b), and (9.43c) also hold in G(-). Further, ux € N 
(nucleus), where «4: G — G is a mapping such that x - ux = 1x. Conversely, if 
G(.) is a Moufang loop with ux € N for all x € G, then the identity (9.43) and 
(9.43a), (9.43b), and (9.43c) all hold in G. 
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9.6.3 Extra Loop 


In a loop G(-), the following identities are equivalent: 


(9.8) yx + 2x = (y+ xz)-x, 
(9.8a) (xy -z)-x =x-(y- 2x), 
(9.8b) xy -xZ=xX-(yx-Z). 


A loop G(-) satisfying any one (and hence all) of these identities is called an extra 
loop (Fenyves [270]). 

Let a : G > G be any mapping. Let us consider the following identities in G(-) 
similar to the identities above: 


(9.44) yx -(Z-ax) = (y-xz)-ax, 
(9.44a) (xy -z)-ax =x-(y-(@-ax)), 
(9.44b) xy-(ax+z)=x-((y-ax)-Z). 


Result 9.16. [522]. Let G(-) be a loop and let a : G — G be any mapping such 
that the identity (9.44) holds in G(-). Then G(-) is Moufang and (9.44a) and (9.44b) 
also hold in G(-). Also ux € N (nucleus), where u : G — G is any mapping such 
that ux = ax -x. Conversely, if G(-) is Moufang and ux € N, then (9.44), and so 
(9.44a) and (9.44b) also are satisfied in G(-). 


9.6.4 Characterizations 


We give characterizations for Moufang loops, extra loops, and groups with the use 
of the results above. 


Result 9.17. [522]. Let G(-) be a loop and i, u,a:G => G satisfy Ax =x + ux = 
x + (ax +x). Then the following conditions are equivalent: 


(a) Identity (9.43), (9.43a), or (9.43c) is satisfied for allx,y,z €G. 

(b) G(-) is Moufang, and ux € N (nucleus) for all x € G. 

(c) Identity (9.44) or (9.44a) holds for all x, y,z € G. 

9.6.5 Characterization of Moufang Loops 

Let G(-) be a loop. Then 

G(-) is Moufang © yx -zx7! = (y-xz)x7! hold in G(-) 

& (xy-z)xb =x-(y-(¢-x7})) hold in G(-) 
> xy+(x7!z) =x-((y-x7!)z)_ hold in G(-). 


9.6.6 Extra Loops 


Let G(-) be a loop. Then G(-) is an extra loop & G(-) is Moufang andx* ENS. 
Identity (9.43), (9.43a), or (9.43b) holds with Ax = x?. 
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9.6.7 Groups 


Let G(-) be a loop. Then G(-) is a group + . Identity (9.43), (9.43a), or (9.43b) 
holds with Ax = x?. 

The Moufang identities (9.6), (9.6a), (9.6b), and (9.6c) induce generalized 
Moufang identities. Here we consider the Moufang functional equation 
(9.45) Ci (x, Co(y, C3(x, z))) = Ca(Cs(Co(, y), y), z 


on GD-groupoids ($1, 55, 5; C1), (S2, S4, 55; C2), ($1, $3, Sa; C3), ($1, 53, 5; C4), 
(S6, 51, 57; Cs), (S1, $2, S6; Co). The solution is obtained by reducing it to a simpler 
functional equation 


(9.45a) C(x,o0- P(x,v’)) + C(x, 0+ wx)-0’" 


(-) being a loop operation and C and P right G-quasigroup operations (see Theo- 
rem 9.14). 


9.7 GD-groupoid 


(S1, S2, 53; a) is said to be a GD-groupoid, where S; (i = 1, 2,3) are distinct non- 
empty sets and a : S; x S2 — $3 is onto (surjective) such that the translations 
La(a)y = a(a, y) and R,(b)x = a(x, b) are onto for every fixed a € S}, b € So. 

If the mappings L, (a) and R,(a) are also 1-1 for every a € Sj andb € Sz, we 
obtain a right and left G-quasigroup, respectively. If a GD-groupoid is both a left 
and a right G-groupoid, then it is called a G-quasigroup. 


Result 9.18. (Pavlovi¢ [665]). All solutions of the functional equation (9.45) with the 
assumption that C, and C2 are right G-quasigroups and C4 is a left G-quasigroup 
are given by 


Ci(x,0) = y C(x, dv), Cr(y,u) = ay- pu, C3(x,z) = B' P(x, 62), 
Ca(r,z) = (0 'y $2), C5(Co(x, y), x) = OC(x, ay - yx), 


where the mappings B : S4 > Ss, y : Ss > S, 6: Ss — S7 are 1-1 and onto the 
mappings a: So > S5, 6: S3 > Ss, w: S$; — Ss are onto, and the loop S5(-) and 
the right G-quasigroups (S,, Ss, Ss; C) and (S,, S5, 85; P) satisfy equation (9.45a). 


Similarly, functional equations induced by extra l.c. identities can be considered. 


9.8 More Identities 


9.8.1 Entropic, Bisymmetric, or Mediality Identity 
A groupoid G is said to be entropic provided 


(9.46) ab-cd =ac-bd_ holds foralla,b,c,d € G. 
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Obviously, every Abelian group is an entropic groupoid. We will consider the 
converse. 


Theorem 9.19. (Morgado [639]). Let G be an entropic groupoid. Then G(-) is an 
iso-Abelian group if and only if 


(9.47) a=b-(ba-cc), fora,b,c€G, 


holds. 


Proof. We present a simple, direct, shorter proof different from that in [639]. 

Suppose (9.47) holds. (-) is l.c. Let ba = bd. Replacing a by d in (9.47), we 
get a =d. Applying l.c. to (9.47) results in cc = constant = e, say. Nowa = b in 
(9.47) gives b = be. Then 


XZ=XZ-C=XZ-yy=Xy-zy 


(the only place where an entropic groupoid is used), which is transitivity (9.3). 
We will now show that the 7’-condition holds. 

Suppose X1y¥2 = X2Y1 and X13 = X3Y1, X1,X2,X3, V1, V2, V3 € G. Then, using 
(9.46), xoy1 + X31 = X1y2 + X13 implies x2x3 = e€ - y2y3 = y3y3 + y2¥3 = y3y2. 
Finally, 


(9.46) 
X2V3Z° = X2N3* V3X3 = XZ V2 * V3XZ3 = C+ V2XZ3 = XZX3 * V2X3 = X32. 


Thus the T-condition holds. Hence G(-) is isotopic to an Abelian group G(o). 
The isotopy may be given by 
aob=a-eb. 


This completes the proof. 


Remark 9.19a. Suppose a = (cc - ab) - b holds instead of (9.47). Then define 
axb=b-a.Then 


a=bx(cc-ab) =b*x (ab *cc) =b x ((b* a) * (c *Cc)), 
which is (9.47). 
Theorem 9.19b. Instead of (9.47) in Theorem 9.19, suppose 
(9.47b) a=b-(cc-ab), fora,b,céG, 
holds. 


Proof. Suppose ab = db. Equation (9.47b) imples rc. Let ba = bd. Equa- 
tion (9.47b) yields 


a=b-(ca-cb) (by (9.46)) 
=b-(ba-bb), 
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but 
d=b-(bd-db), 


which shows that a = d; that is, l.c. Use Lc. and rc. to have cc = constant = e, and 
b = a in (9.47b) shows that a = ae. 

yZ = yZ-xx = yx - zx, which is transitivity. As in Theorem 9.19, it can be 
shown that the T-condition holds. Thus G(-) is an iso-Abelian group. 


Result 9.20. (Bednarek and Wallace [109]). Suppose G is a Hausdorff groupoid in 
which (x, y) > xy is continuous and xy - yz = xz holds. Then 


(i) mediality (9.47) holds; 
(ii) S? = S £ d, where S = the set of all idempotents in G; 
(ili) ee SS xe=xe-eandey =e -ey; 
(Gv) GS = SG = S$’; 
(v) e € S > Ge is a subsemigroup of S. Moreover, Ge = Ge- Ge and x — xe is 
a retractioning mapping (continuous homomorphism f such that f* = f); 
(vi) G is a semigroup; 
(vii) the minimal ideal of G is G* = (SN Ge) - (eSe)- (eG NS); 
(vill) suppose xy - yz = xz is implied by y- (xy-zx) = yz. Moreover, if G is a com- 
pact connected Hausdorff space, then the mapping (x, y) > xy is constant. 


9.9 Left Inverse Property (l.i.p.), Crossed-Inverse (c.i.), and 
Weak Inverse Property (w.i.p.) Loops 


We now give characterizations for (1) groupoids (loops) with identities satisfying 
li.p., (ii) the c.i. property, and (iii) w.i.p. as subvarieties of groupoids satisfying 
certain conditions (single identities). 


9.9.1 1.i.p. Loops 


A groupoid G(-) is said to have the left inverse property if, for every x € G, there is 
at least one a € G such that a - xy = y forall y € G. Consequently, the equation 
cx = d is solvable for x. Similarly, we can define right inverse property in a groupoid 
that implies the solvability of yc = d. If G(-) satisfies both the .i.p. and ri.p., then 
G is said to have the inverse property and in this case G(-) is a quasigroup (Bruck 
[135, p. 111]). 

Let G(-) be a groupoid. G(-) is an iso-l.i-p. (i.p.) groupoid (or loop) provided 
there is an 1.i-p. (1.i.p.) groupoid (or loop) G(«) with unity that is a principal isotope 
of G(-) such that «* and - are connected by the relation x*y = x-A(y) (x*y = p(x)-y) 
for all x, y € G, where A(p) is the left (right) inverse operator of G(x). 


Result 9.21. (Kannappan [434]). A necessary and sufficient condition that a 
groupoid G(-) is an iso-l.i.p. groupoid is that the identity 
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ov - [{(tt)x - (uu)(xy)} - (ww)] = y 


or 


[vo - {(t1)x - (uu)ry)}] - (ww) = y 


holds for all x, y,u,v, w,t €G. 


9.9.2 c.i. Loops 


Loops G(-) in which the equivalent identities xy - p(x) = y or A(x) - yx = y hold 
for all x, y € G are called crossed-inverse (c.1.) loops, where p(A) is the right (left) 
inverse operator [72, 135]. 

Let G(-) be a groupoid. Then G(-) is called an iso-c.i. loop if there is a c.i. 
loop G(x) that is a principal isotope of G(-) such that - and * are related either by 
xX *y = p(x): yorbyx*xy=x-A(y). 


Result 9.22. [434]. Let G(-) be a right quasigroup. Then G(.) is an iso-c.i. loop 
provided 

X-yx = y- (tt) 
holds for allx,y,t € G. 


9.9.3 w.i.p. Loops 


Let G(-) be a loop with identity 1. Then G is said to be a loop with weak inverse 
properties if whenever three elements x, y, z € G satisfy the relation xy-z = 1, they 
also satisfy the relation x-yz = 1. The properties y-p(xy) = p(x) or A(xy)-x = A(y) 
are equivalent to the definition of w.i.p. loop above (Basarab [104]). 

Let G(-) be a groupoid. We say that G(-) is an iso-w.i.p. loop provided there is 
a w.i.p. loop G(«) that is a principal isotope of G(-) such that + and - are connected 
either by x -y = A(x) * yorbyx-y=xx* p(y). 


Result 9.23. [589]. A necessary and sufficient condition that a left quasigroup G(-) 
is an iso-w.i.p. loop is that 


[(¢t) - x] - (cx) = (uu) -z 


holds for all x,z,t,u € G. 


9.10 Steiner Loops 


Mendelsohn [634] has defined the concept of a generalized triple system as follows. 
Let S be a set of v elements. Let T be a collection of b subsets of S, each of 
which contains three elements arranged cyclically and such that any ordered pair of 
elements of S appears in exactly one cyclic triplet (note the cyclic triplet {a, b, c} 
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contains the ordered pairs ab, bc, ca but not ba, cb, ac). When such a configura- 
tion exists, we will refer to it as a generalized triple system. If we ignore the cyclic 
order of the triples, the generalized triple system is a B.I.B.D. 

There is a one-to-one correspondence between generalized triple systems of 
order v and quasigroups of order v satisfying the identities x7 = x, (xy)x = 
x(yx) = y. The term generalized Steiner quasigroup means a quasigroup that satis- 
fies the identities above. 

Let G be a generalized Steiner quasigroup of order v. From G, a loop G* with 
operator * is constructed as follows. The elements of G* are the same as those 
of G together with an extra element e. Multiplication in G* is defined as follows: 
axe=exa=a; axa =e;andfora,b € Gwitha 4 b, defineaxb=a-b. 
It follows easily that G* is a loop with identities x *e =exx =x, xX*xX = 
e, x *(y *x) = (* * y) *x = y. Also, the correspondence between generalized 
Steiner quasigroups and generalized Steiner loops is a bijection. 

A loop that satisfies the identities 


(9.48) xXx =e, xe=xX=ex, X-yx=y=xy-x, forx,yEG, 


is called a generalized Steiner loop (g.s.1.). In [640] is given the identity (9.49) 
characterizing g.s.l. Five equivalent identities were found immediately afterwards 
in 1970 to characterize g.s.l.s. Now the time has come to put them in the following 
theorem. 


Theorem 9.24. A groupoid G(-) is a generalized Steiner loop if and only if G satis- 
fies any one of the identities 


(9.49) a-[((bb)-c)-a]=c, 
(9.491) [a-c(bb)]-a=c, 
(9.4911) a-(ca-bb)=c, 
(9.49111) (a-ca)-bb=c, 
(9.49iv) bb-(a-ca)=c, 
(9.49v) (bb-a)-(ca-dd)=c, 


fora,b,c,d €G. 


Proof. First we consider (9.49), treated in [640]. Here we present a different, simpler 
proof. 
In (9.49), replace c by (dd - k) - bb and use (9.49) to get 


a-ka = (dd-k)-bb, 
and 
(9.50) bb-(a-ka)=k, fora,b,k €G. 


Suppose ka = ua. Then (9.50) shows that k = uw; that is, (-) is nc. Apply rc. in 
(9.50) to obtain bb = constant = e (say). Then (9.49) becomes 


a-(ec-a)=c. 


Put c = e to obtaina -ea = e = ea - ea, implying ea = a.So,a-ca=c. 
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First, a = e in (9.49) yields ce = c, and replacing a by ac gives ac: (c-ac) =c; 
that is, ac -a = c. This proves it is a g.s.1. 
Now we take up (9.491): 


(9.491) [a-c(bb)|-a=c fora,b,c eG. 
Put c = dd - bb - (c - dd) in (9.491) to have 


ac-a=bb:-(c-dd), 
(9.501) (ac-a)-bb=c, fora,b,ceEG. 


Now ac = ad in (9.501) gives c = d and l.c. and 
(ac-a)-bb = (ac-a)-dd. 


l.c. implies bb = constant = e and (ac -a)-e = c. Nowc = e gives (ae-a)-e = 
e = (ae-a)-(ae-a); that is, ae-a = e = ae-ae, implying ae = a. Equation (9.491) 
yields ac-a=c. 

First a = e gives ec = c, and replacing a by ca shows that (ca - c)- ca = c or 
a-ca =c. This proves it is a g.s.1. 

Next we consider 


(9.4911) a-(ca-bb)=c_ fora,b,ceéG. 


ca = da in (9.49ii) yields c = d and thereby r.c. Suppose ca = cd. Equation (9.4911) 

is d- (cd - bb) = c, which by using r.c. gives a = d; that is, lc. Apply l.c. to 

a:-(ca-bb) =c =a-(ca-dd) to get bb = e. Now (9.49ii) becomes a- (ca-e) =c. 

As before, c = a shows that ae = a anda-ca = c. As before, a = e gives ec =c 

and changing a to ac gives ac: (c- ac) =c orac:a=c. This proves it is a g.s.l. 
Now we take up 


(9.49iii) (ac-a)-bb=c. 


Now ac = ad in (9.49i1i) gives c = d and |.c. Then (9.49iii1) yields bb = constant = 

e and (ac-a)-e = c. First, as before, c = e gives (ae-a)-e = e = (ae-a)-(ae-a); that 

is, ae-a = e. As in the proof of (9.491), we obtainae = a = eaandac-a =c =a-ca. 
Next in line is 


(9.49iv) bb-(a-ca)=c_ fora,b,c EG. 


First ca = da gives c = d andr.c. and then bb = e ande- (a- ca) = c. Nowc =e 
yields a-ea = e = ea -ea, implying ea = a anda-ca = c. Use previous arguments 
to getade =a, ac-a=c. 

Finally, we consider 


(9.49v) (bb-a)(ca-dd)=c_ fora,b,c,d€G. 


First ca = ua shows c = u andr.c. and then bb = e. Now ea: (ca-e) = c. Seta =c 
to have ec-e = c. Putc =e to get ea: (ea-e) = e orea-a =e. Hence ea = a. So, 
ce =canda-ca = a. Finally, a to ac yields ac - (c- ac) = a orac- a =a. This 
proves it is a g.s.l. 
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9.11 Bol Loop and Power Associativity 


It is known that a right (left) Bol loop is power associative. Here we prove that the 
right Bol loop satisfying (9.5’) is power associative by using the hexagonal closure 
condition. 

Suppose (9.5’) holds. Put y = x7! in (9.5’) to obtain 


(@x)-x") -x = z- (ax! -x) = zx, 


' — ¢; that is, the right inverse property holds in G. Suppose 


implying zx - x 
(9.51) 9 x1y2=x2y1, X13 =xX2yV2= X31, forxj, yi € G @ = 1, 2, 3). 
Use (9.51) and the right inverse property to have 

xX, = Xx2y2- ae 


_ (9.5’) = 
xiy2 = (xay2-yy!)+ yo = x2- (yayz' + y2) 
= *2 91; 


that is, y2y3 -y2 = y;. Now 


x3y1 = x3 - (293! + y2) 
= (x3y2-y3')-y2 by 9.5’) 
= X2)2; 


so that x2 = x3y2- ye i that is, x2y3 = x3y2. Hence the H-condition holds. Thus a 
right Bol loop is power associative. 


9.12 More Functional Equations 


We consider the generalized equations of associativity, transitivity, and bisymmetry 


(9.52) (xly)2z = x3(y4z), 
(9.53) (x1z)2(y3z) = x4y, 
(9.54) (x Ly)2(z3u) = (x4z)5(y6u), 


where 1, 2, 3, 4, 5, 6 are binary operations and x, y, z, u belong to certain sets. 
First we consider the generalized associativity (9.52) on the quasigroups G(i), i = 
1,2,3,4. 


Theorem 9.25. If (9.52) holds for all x, y,z € G and the G(i) (i = 1, 2,3, 4) are 


quasigroups, then each G(i) (i = 1 to 4) is isotopic to the same group. 
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Proof. (See also Taylor [802].) We will prove that the R-condition holds. Suppose 


(9.52a) x14y2 = x04y1,  xX14y4 = x24y3, and x34y2 = x44y}. 


y4 
3 


y2 


y1 


X1 X2 X3° X4 


Sety =x2, z=ypandy=x1, Z=y2,;y=x2,2 = yz3 andy = x1, Z= y4; 
and y = x3, Z = y2 and y = x4, z = yj Separately in (9.52) and use (9.52a) to get 


(9.55) (x1x1)2y2 = @lx2)2y1, 
(9.56) (x 1x2)2y3 = (wlx1)2ya, 
(9.57) (x1x3)2y2 = (x1x4)2y1. 


Choose z1, z2 € G such that z; 1x3 = z21x; (note that G(1) is a quasigroup). 
Put x = z2 in (9.56) to get 


(9.56a) (z21x2)2y3 = (z21x1)2y4. 
Let x = z, in (9.57) and x = z2 in (9.55) to have 


(z11x3)2y2 = (z11%4)2y1; 


that is, 

(z21x1)2y2 = (z1x4)2y1 
and 

(zo1x1)2y2 = (z21x2)2y1. 
Thus 

(z11x4)2y] = (Z21x2)2y1 
or 


zylxq = z21Xx2. 


Now (9.56a) becomes 


(zi 1x4)2y3 = (221x1)2y4 = (211x3)2 ya; 
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that is, by (9.52), 

z13(x44y3) = 213 (x34 ya). 
Since G(1) is a quasigroup, x44y3 = x34y4; that is, the R-condition holds in G(4). 
So G(4) is isotopic to a group. Similarly, that G(@) (i = 1, 2,3) are isotopic to a 
group can be proved. This completes the proof. 
Result 9.25a. [802]. Generalized equation of transitivity. If (9.53) holds for x, y, z 
in G and G(i) (i = | to 4) are quasigroups, then each G(i) is isotopic to the same 
group. 
Result 9.25b. [802]. If (9.54) holds for all x, y,z,u € G and each G(i) Gi = 
1,2,..., 6) is a quasigroup, then every G(i) is isotopic to the same Abelian group. 


9.13 Generalized Groupoids 


Now we consider the generalized equations of associativity and bisymmetry on 
G D-groupoids; that is, on different sets. A generalized groupoid (X, Y, V,i) will 
be denoted by “7”. 

Let (X,Y, V, 1), (V, W, S,2), (X, U, S, 3), and (Y, W, U, 4) be GD-groupoids. 

An element b € Y is a right generator of the generalized groupoid (X, Y, V, i) 
if Xib = {xib|x Ee XJ=V. 

Anelementc € V isa right solution element of i if xiy = c has a solution y € Y 
forall x € X. 


9.13.1 Generalized Associativity 


Result 9.25c. (See Taylor [803].) Let 1, 2, 3, and 4 be generalized groupoids that 
satisfy (9.52). If 1 has a left generator, 2 is right cancellative, 3 is left cancellative 
and has a left solution element, and 4 has a right solution element, then the solution 
of the generalized associativity equation is given by 


xly=$'(f(x)g(v)), 022 = d(v)h(2), 
x3u = f(x)yu), y4z = w'(g(y)h(z)), 


where (S,-) is a group, f : X > S, g: Y > S, h: Z = S are surjections, and 
d:V—>S, w:U = Sare bijections (see also [113)). 


9.13.2 Generalized Bisymmetry 
Let (X,Y,H,1), (H,S,G,2), (W,U,S,3), (X,W,T,4), (7,V,G,5), and 
(Y, U, V, 6) be generalized groupoids that satisfy (9.54). 


Result 9.25d. [803]. Let 1, 2, 3, 4, 5, and 6 be generalized groupoids that satisfy 
(9.54). If 1, 4, and 6 each have both a right solution element and a right generator 
and if 2 and 5 are cancellative, then there exists an Abelian group G(+), surjections 
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g:X ~G,f:YoG,h:wW->G,k:U — G, and bijections a: H > 
G, B:S—>G, y :T > G, 6: V = G that satisfy 
xly=a'(f(x)+8(y)),  r2s=a(r)+ BGs), 
z3u=f'(h(z)+ku)), x4z=y7'(Fx) +h), 
tSv = y(t) + dv), you = 0! (g(y) + k(w)). 


10 


Functional Equations from Information Theory 


In this chapter, information theory using desirable properties and representation, 
Shannon’s entropy, directed divergence, generalized directed divergence, entropy of 
order a and degree f, and weighted entropy are treated. The Fundamental equations 
of entropy and their generalizations and sum form equations and their generalizations 
are treated. Distance measures and inset measures are performed, and applications 
are treated. 


10.1 Introduction 


In this chapter, we briefly consider various measures of information and their salient 
properties and bring out their natural connections to many functional equations and 
their characterizations. It is not an exaggeration to say that the concept of informa- 
tion has played an important role from its beginning since it has so many aspects 
and applications in various fields. A fundamental question in information theory 
is how to measure the amount of information, in other words, how to define infor- 
mation. Several approaches are possible. One is a pragmatic approach that starts 
from certain particular problems of information theory and accepts as measures of 
the amount of information the quantities which present themselves in the solution. 
Another approach may be described as the axiomatic or postulational approach in 
which one starts with certain properties that a reasonable measure of information 
has to possess, and after this it is a purely mathematical question of determining all 
the expressions that possess these postulates. This is the point of view adopted by 
C.E. Shannon in 1948 in his now famous fundamental paper “A mathematical theory 
of communication”. This led to the study and application of functional equations in 
information theory. These two points of view are according to the opinion of Rényi 
[699]—they are not as opposed to each other as they seem to be but rather are com- 
patible and even complement each other. As our interest is focussed on functional 
equations, this chapter follows the point of view of the postulational approach. 
Information is what is provided by a descriptive statement that affects our knowl- 
edge; that is, it is the information content of the statement. “Information” is also 
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known as uncertainty. When one faces an alternative, one finds oneself in a state 
of uncertainty. Any process of knowledge or decision entails the transition from 
one state of uncertainty to another. This cannot take place without losing or gaining 
something. The term involved in this process is what is called information. The end 
result is information gain. Note that we can interpret a decrease or loss of infor- 
mation as an increase or gain of information. This was the reason for saying that 
a measure of the amount of uncertainty is the same as a measure of the amount of 
information. 


There is another word that is associated with information, which is entropy, sug- 
gested by von Neumann and (originating with Boltzmann, Gibbs, and Shannon) in- 
troduced in physics. The entropy of a probability distribution can be interpreted not 
only as a measure of uncertainty but also as a measure of information. As a matter of 
fact, the amount of information that we get when we observe the result of an exper- 
iment can be taken to be numerically equal to the amount of uncertainty concerning 
the outcome of the experiment before carrying it out. We shall use the word entropy 
to indicate a measure of uncertainty. 

Since the advent of the notion of information, due originally to Hartley, it has 
been much developed by Shannon, and it is with Shannon’s name that it is associ- 
ated, as Shannon entropy. Since then, quite a large number of papers have appeared 
on the subject. It was developed by many authors, including Feinstein, Kullback, 
Rényi, Wiener, and others, and was also advanced into the ergodic theory by Gelfand, 
Khinchin, Kolmogoroff, Yaglon, and others and applied to investigate the theory of 
transformation with invariant measures by Kolmogoroff and others. The study of 
information theory, broadly speaking, includes entropy theory, coding theory, modu- 
lation theory, noise reduction theory, theory of channels, and applications in ecology, 
psychology, detective work in tracking down criminals, forecasting theory, statistical 
mechanics and thermodynamics, statistics, marketing, economics, image process- 
ing, etc. Mathematicians, physicists, electrical engineers, physiologists, phoneticists, 
psychologists, statisticians, and others have all contributed to this area. 

The first definition of the notion of entropy was given in its generality by Shan- 
non, well known as Shannon’s entropy (SE) [732], as 


n 
(SE) H,(P) = — >> pilog pi, 


i=1 


n 
where P = (pi, p2,..-,Pn)) 0 < pi < 1, D pi = 1, n = 2, 0 € Zt, has 
i=1 
been deduced from various statements of axioms. Hartley’s entropy is a measure 
of uncertainty with a minimum amount of advance information. More precisely, it 
is a measure of the uncertainty one has about the outcome of the experiment when 
the only relevant information is the total number of possible outcomes n. Hartley 
introduced the measure [344] 


H;*(P) = log{number of positive p;’s} = log N(P), 
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where P = (p1,...,pn), 09 < pi < 1, dpi = 1; that is, the measure 
depends only on the number of nonzeros in P, and N(P) is the number of nonzeros 
among P1, P2,..-, Pn (that is, the number of events in an experiment with probabil- 
ity greater than 0). 

A key feature of Shannon’s information theory is the discovery that the collo- 
quial term information can often be given a mathematical meaning as a numerically 
measurable quantity, on the basis of a probabilistic model, in such a way that the 
solution of many important problems of information storage and transmission can be 
formulated in terms of this measure of the amount of information. This information 
measure has a very concrete operational interpretation: roughly, it equals the mini- 
mum number of binary digits needed, on average, to encode the message in question. 
The coding theorems of information theory provide such overwhelming evidence for 
the adequacy of Shannon’s information measure that to look for essentially different 
measures of information might appear to make no sense at all. Moreover, it has been 
shown by several authors, starting with Shannon, that the measure of the amount of 
information is uniquely determined by some rather natural postulates. 

The aim of this chapter is to characterize entropy and other measures of infor- 
mation uniquely by means of physically plausible properties; that is, axiomatic char- 
acterization of measures of information from the functional equation point of view, 
following Shannon’s lead. The most common approach is based on some probability 
theory that includes dependence on the outcomes of an experiment and the corre- 
sponding probabilities. The idea is to define an information measure by a set of 
intuitively natural properties leading to a system of functional equations and func- 
tional inequalities, the solution of which gives rise to characterization of entropies. 
This axiomatic procedure advanced by Shannon is probably the most effective and is 
a most useful one for practical applications. The literature in this field is voluminous. 
A partial list is [43, 699, 700, 732, 698, 97, 283, 48, 804, 569, 570, 344, 238, 239, 
242,571, 245,57, 164]. 
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n 
Let Pp = {P = (pi, p2,---, Pn)s Pi = O with S” pj = 1} be the set of all com- 
i=l 


plete, discrete, nonnegative probability distributions of length n (n > 2); T . = 


n 

{P = (pi, p2,---,Pn)i pi > O and >> p; = 1} be the interior of T, (n 
i=1 

2); An =T, or 9; J = [0,1], Jo = ]0,1[, 4) = J or Ip; 0- logO = 0; 0 


ne 


0; 0- log 8 = 0; logarithms are to base 2; 


IV 


(SF) s(x) =—xlogx — (1 — x) log( — x) for x € J;, Shannon function, 
(10.1) s@,y)=-—xlogy—(1—x)logd—y) forx,yeh, 


(10.1a) s(x) = sa +(1—x-1) forxel, B40, 
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(10.1b) sp(x) =xL(x) +0 —x)LU — x), 
s(x, y,z) = x(a, logx + b; log y + cy logz) 


(10.1c) 
+ y(a log y + b2 log z) + c2z log z 


for P € An, Q € Am define the product P * OQ = (pigj) € Am (i = 
1,2,...,n, 7 = 1,2,...,m); and a sequence of map My : Ak >R(a>2,keE 
ysis k > 1) is called an information measure. 


10.2.1 Properties, Postulates, and Axioms 


Shannon emphasized that “the hard core of information theory is essentially a branch 
of mathematics, a strictly deductive system; a thorough understanding of the math- 
ematical foundations and its communication application is surely a prerequisite to 
other applications”. Many of the inventions deal with the storage, transmission, 
transformation, and retrieval of information. Information occurs in various forms: 
oral, written, visual, electronic, mechanical, etc. From a mathematical point of view, 
the essence of information is its quantity, and the basic problem is how to measure it. 
As mentioned earlier, we adopt the axiomatic (mathematical) approach of Shannon. 
Axiomatic characterization of measures of information in general and Shannon’s 
entropy in particular is done by judiciously choosing the properties (algebraic and 
analytic) satisfied by them. These lead to the study of many functional equations. 
Apart from Cauchy functional equations, which occur in various characterizations, 
and numerous other functional equations, there are two categories of functional equa- 
tions that play a prominent role. One is known as the fundamental equation of infor- 
mation theory, 


(FEI) roy ede (Fe -)= fort d= (; =). 


for x, y € [0, |[ with x + y € J, and its generalizations, and the other is referred to 
as the sum form equation, 


n m m 


(SFE) > > fGiys) = y sev + ZA 


i=1 j=1 


for (x;) € An, (vj) € Am, and its variations. These equations will be studied in the 
subsequent sections, pointing out the close connection between these two types of 
functional equations. 


10.2.2 Desirable Properties—Postulates 


Algebraic Properties 
n-symmetry 


Hn(P1, Po,..., Px) = Un(Paa), Pa(2)s+++> Pack), where Pj € An G = ee 
k) and a is a permutation on {1,2,..., k}. 
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Expansibility (k = 1) 


Ln+1(P15 P2s-++> Pn» 9) = Mn(P1, P2;---, Pn) for all (p;) € Ty; that is, adding an 
outcome with zero probability does not change the amount of information expected. 


Additivities 
(m, n)-Additive, Subadditive, and Strong Additive 


Consider an experiment P with the possible outcomes c; (i = 1, 2,...,) of prob- 
abilities pj = p(cj) and another experiment Q with possible outcomes ci = 
1,2,...,m) of probabilities gj = = plc; ); denote by P * Q the sombination of the 
two einen with c; 9 c. ,G@ =1, 2,. ,n; j = 1,2,...,m) as possible out- 
comes. It is natural to Speck when the two > ekpeAINeAts are sindependent me is, 
every outcome c; of the first experiment is independent of every outcome c’, f of the 
second experiment) that the information expected from the combination of the two 
experiments is equal to the sum of the information expected from the individual 
experiments. That is, 


Hn(PO « QOPA « QO]... PO « 9) 
= tm(Q™ 10 jo), ig PON PN PO) 


for PY PO)... PO © Ay, 08, 0, ...,0 © Ay PHO S Rie: 
This is called (m, n)-additive. 

In general, we cannot expect more information from a combination of two 
experiments than the sum of the information that can be obtained from the individual 
experiments. For k = 1, this is called subadditive and is expressed by 


Hmn(P * Q) < un(P) + um(Q) 
for P € An, QO € Am. 
(m, n)-Strong Additivity 


Lmn(P1411, P1912,+++5 PIQins P2921, +++» P2G2n,-++5Pm|ml1,---+> Pm4mn) 


m 


= Um(P1, P2; 25 Pm) + >) Pj la Gj1s 4j25 wes jn) 
j=l 


for (pi) € Am, (Gjk)gay € An» f =1,2,...,m 
n-Recursivity or Branching 
Pil, P125+++5 Pin Pil + P12, P13,-++5 Pin Pil, P12 
P21 P225+++5 P2n P21 + p22; P235+++5 P2n P21, P22 
Ln . = Hn-1 . +n-1 . > 


Pki, Pk25+++5 Pkn Pki + Pk2; Pk35+++5 Pkn Pki, Pk2 
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Pi = (pil, Pi2;---, Pin) € An @ = 1,2,...,k), and some function ¢y_1 : i > 
R. 
In a special case (k = 1), the sequence {/y} is said to be recursive provided 


Ln(P1, P2s+++> Pn) = Hn—-\(P1 + P2, P35-++5 Pn) + gn—1(P1, p2) 


for all P = (pi, p2,-.--, Pn) € An and for some function ¢,_1 : i —-R. 

A special form of ¢,—; used in many characterizations is ¢)—1(p1, p2) = (pi + 
P2) 2 Ca — with p; + p2 > 0. 

Suppose we want to know the outcome of a given experiment P with outcomes 
C1,€2,...,Cn Of probabilities pj, p2,..., Pn. Alternatively, we could perform the 
experiment Q with outcomes cj Uc2, c3, ..., Cn and corresponding probabilities p;+ 
P2, P3,++-» Pn. This may result in some loss of information. The basic assumption 
underlying the recursivity is that the amount of information that is lost by observing 
the outcome of Q instead of P depends only on the probabilities p; and p2 of the 


respective events c; and c2. 


Normality 


In the case k = 1, 2 (5. s) ie 


Representation 
Sum Property 


A sequence of measures Ly : Ne — R (k = 1) is said to have the sum property 
provided there exists a function f : J;' > R such that 


n 
Hn(Pil|Pall == Pk) = 0 f(Pu, pais +s Pei) 


i=1 


for all Pj = (pji, Pj2,--+»Pjn) € An G =1,2,...,k) and for all n > 2, and the 
function f is called a generating function of the sequence {Un}. 
When k = 1, the representation is 


n 


Un(P) = >° f (pi) for P € An, 


i=1 
and when k = 2, the sum property is 
n 
Un(P\|O) = >_ f(pisgi) for P,Q € An. 
i=1 


Quasilinearity 


For the case k = 1, there exists a continuous, strictly monotonic function y : Ry > 
IR such that 
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n 
un(P) = Ww! (| > pjw(—log pj) } for pj > 0 
j=l 


for P € T°. 

Regularity 

We illustrate regularity for the case k = 1. &n(p1, p2,---, Pn) is continuous on Ay. 
Define 

(10.2) f(p) = #2(p,1- p) forpen. 


f is called the Shannon information function. Some of the regularity properties used 


e f is Lebesgue integrable on /,, 

e ff is measurable, 

e f is continuous at 0, 

e f is continuous on J), 

e ne LH2(p, | — p) = 0 (smallness for small probabilities), which gives the in- 
p> 


tuitive statement that we obtain very little information out of an experiment with 
two possible outcomes, one of which is almost certain and the other almost im- 
possible, 

f is increasing or decreasing on ]0, sL 

0 < f(p) <k for p € 1, k some constant, 

f is bounded on an interval or bounded on a set of positive measure, 

f is differentiable, etc. 


10.2.3 Characterization of Information Measures 


Some of the well-known measures of information are 


n 
(SE) H,(P) = —- > Di log pi (Shannon’s entropy), 
i=l 
(KD In(P||Q) = - ~ pi log gi (Kerridge’s inaccuracy), 
(dd) Dn (P||Q) = > pilog - (directed divergence), 
L 
di (generalized 
dd GD,(P R)= ; log — 
(gd) n(PILOIR) > Bee Tj directed divergence), 
1 n 
(10.3) HY (P) = i log (> r) (entropy of order a), 
-—a 
i=1 


a#l,a>od 
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1 n 
(10.4) HP (P) = 58 24] 3 pe - ) (entropy of degree or type f), 
i=1 


pH 
(10.5) H,(P, W) = — > w; pi log Di (weighted entropy), 


etc., for P,Q,R € An, W= (wj,..., Wn), wi = 0. 

There are many algebraic properties that are shared by these measures. It is easy 
to check that all these measures are symmetric; whereas the first four measures are 
additive and recursive, the last three are nonadditive; all the measures above have 
representation or the sum property except Rényi’s entropy of order a; the generating 
functions are f(p) = —plogp, g(p,q) = —plogg, or plog®, h(p,g.r) = 
p log Z, k(p, w) = —wp log p, f’(p) = c(p* — p), etc. There are properties that 
are special to individual measures. 

Now we deal with the axiomatic characterization of these measures through some 
of the properties they possess. We start off with the Shannon entropy. 


10.2.4 Shannon Entropy and Some of Its Generalizations 


Shannon’s entropy satisfies the inequality known as Shannon’s inequality, 
n n 
(SI) — > pilog pi < — >> pilogai. 
i=l i=1 


P = (pi), Q = (qi) € An, which will be dealt with in Chapter 16, on inequalities. 

We will show that whereas symmetry and recursivity lead to the fundamental 
equation of information (FEI), the additivity and the sum representation lead to the 
sum form functional equation (SFE). First we take up (FEI). 


10.2a Fundamental Equation of Information—Axiomatic Char- 
acterizations 


Several authors have given different sets of postulates to characterize Shannon’s en- 

tropy (SE). Let “, : An > R, n > 2, be a sequence of information measures. 
Shannon [732] used the postulates 2, is n-symmetric, fz, is continuous in each 
: 11 11 1 1 1 

variable, H2(5, z) = 1, Lnl>> morte =) < Lnti(aa7> Satay Pray and 


Ln(P1; P2,-+++> Pm—-1>, Pm1,-++> Pm4n—m+1) 


(10.6) 
= Lm (P1, P25+++5 Pm) Se PmLn—m+1(Q1; G25+++5 dn—m+1) 


to characterize (SE), whereas Kinchin [534] showed that n-symmetry, continu- 
ity in each variable of “,, strong additivity, expansibility, normality, nonnegativ- 
ity, Un(P) = 0, and wy(P) < nls, i, ee +) characterize (SE). But Faddeev 
((267, 266]; see also Rathie and Kannappan [688] and Aczél [13]) gave the sim- 
plest of postulates and proved the following theorem that is of basic importance in 
information theory. 
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Theorem 10.1. Suppose My on Ty is n-symmetric, f given by (10.2) is continuous 
for p € [0, 1], 25, 5) = |, and Uy is n-recursive, 


Ln(P1, P2.+-+> Pn) = Un-1(P1 + P2, P3,+++>s Pn) 


(10.7) Pi p2 
+ (pi + poe , 
Pit p2 pit p2 


for P = (pi) € Tp with py + p2 > 0. Then uy(P) = An(P). 


Proof (1). The proof given by Faddeev consists of the following steps. 
The function L : Zi — R defined by 


1 1 1 
L(n) = Ln “9 ~oeee9 7 Fo n>, 


non n 


(i) satisfies (L) (called a completely additive number-theoretical function), and 
Gi) lim (L(n+ 1) — L(n)) = 0. 
n—->Co 


(i) and (ii) together with L(2) = 1 imply L(n) = logn. Now claim jy, is expan- 
sible. Using symmetry and recursivity (10.7), for p € J, 


3(p,1— p,0) = wo, 0) + p2(p, 1 — p) 
—_ H2(p, 9, 1 ~ P) 
= H2(p,1— p)+ ppro(l,9). 


Thus w2(1, 0) = 0 and w3(p, 1 — p,0) = w2(p, 1 — p). Further, 


Hnt1(P1,-++5 Pn, 9) = nti (1,9, pr, ---s Pn) 
= Mn(P1, P2; oh ag: Pn) a piled, 0) 
= Mn(P1, P2; 9a: Pn); 


m 


that is, “, is expansible. jz, is also strongly additive (see [43, p. 62]). Letr = 7 
be a rational, m < n. Now applying (2, 7)-strong additivity using symmetry and 
expansibility, 


m 1 m 1 
Lon (1- =) +5 (1-=)-——, 0... 
n n= 1 n 1n-iror_—”” 
tH 


m times 
(n—m) times 


m i m 1 
—-—,...,—-—, 0,...,0 
n m n m 
———— 

: (n—m) times 
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n—-m 1-i-_— 
oO”? Ors times 
(n—m) times 


—__ : ; 
: (n—m) times 
m times 


Hence 


f(r) = 2 (—,1- =) 


n 


(- -) m (- ~) 
=Un | —,---5-~F — Hm 1 meee a 
n n n m m 
m 1 1 
= (i- “) Hn—-—m | —— + +> 
n n—-m n—-m 


m m 
= logn — — logm — (1 - “) log(n — m) 
n n 


=-rlogr —(1—r)log(l—r)_ for,r rational in /;. 


Since f is continuous, f(x) = s(x), the Shannon function (SF). 
Now apply recursivity (10.7) to obtain unp(P) = Hy(P). 


Proof (2). Now we present a simple proof of the theorem involving (FEI), but first 
we give the motivation of (FEI) [196]. 


10.2b The Fundamental Equation of Information Theory 


Let a be arandom event, and denote the probability of a by x = p(a). If we perform 
an experiment concerning a, then we denote the measure of information obtained by 
the experiment by (a). We suppose that the unknown quantity / (a) depends only 
on the probability x of a; that is, 


I(a) = f@), 


where f(x) is a real-valued function that is defined in the closed interval [0, 1]. Let 
f be another event such that af = 0 and let y = p(f) be its probability. After 
the experiment concerning a, we perform a second experiment concerning /. The 
second experiment yields the relative information /(/|a), and we define it as the 
information belonging to the probability 


10.2 Notation, Basic Notions, and Preliminaries 413 


y _ ptf) 
l-x  p@) 


taken with the weight | — x = p(@); that is, 


1(Bl@) =(1-x)f (4) : 
Having performed both experiments, we obtain the gain of information 
I(a,p)=1(a) + 1(f\a) (af = 0), 
and we suppose that this is independent of the order in which the experiments con- 


cerning a and f/f were performed. This assumption means that the unknown function 
f (x) satisfies the functional equation 


(FED f(x) + (1-3) f (4 : ) =fo)+(-y)f (— ) 
—x l-y 
for all pairs (x, y) such that 


(x,y)€ D={(@, y): x,y €[0,1L, @+ y) € J}. 


Now we will derive (FED) from the axioms. Using 3-symmetry, 3-recursivity, 
(10.7), and (10.2), we have for P = (—pj, p2, p3) € T3, 


1 oy) 
H3(p1, p2, p3) = H2(pi + p2, p3) + (pi + p2)H2 ( i ; — 
Pit+p2 pit p2 


=foy el =nF ( 2 ) 


1— p3 
= H3(p2, Pp, P1) 


P2 P3 
= Ho(p2+ p3, pi) + (ra + pata ; ) 
P2+ p3 p2+ p3 


= f(r) + - pif ( Pe). 


Thus we obtain the fundamental equation of information theory (FED) for x,y € 
[0, I[ with x + y € [0, 1], where f : [0,1] > R. 


Now we determine the regular solution of (FEI). 


Solution of (FEI). Suppose f : J — R satisfies (FEI) for x, y € [0, I[ with (x+y) € 
I and is integrable or measurable. Then f is given by 


(10.8) f(x) =cs(x)+ dx forx € J, 


where s(x) is the Shannon function (SF). 
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Note that the integrability or measurability of f implies the continuity of f, 
which in turn implies the differentiability of f (Aczél and Dardéczy [43]), and then 
f has derivatives of all orders. Here we need the derivative of order two only. 

Now differentiate (FEI) with respect to x and then the resulting equation with 
respect to y to get 


y " y _ ” 
G—ap! (.)- ae (+) 


u(1—v) = v(1—u) 
l—uv ? ~~ J=uv) ? 


Now put ;4 =u, Tay = », so thatu,o € 10, 1[, y= 


1—uo 1—uo 
—— f"(u) =0- f"(v) 
i) l-u 
or 
u(1 —u)f”(u) = constant = c, 
and é Z 
f'@=-+ 
u 1- 
Hence 


fu) =c[-ulogu — (1 —u) log — u)] + bu+d 
=cs(u)+ bu+d, 
where b, c, d are constants. This f satisfies (FED) provided d = 0. Thus the solution 
of (FED) is (10.8). 
Continuation of Proof (2) of the Theorem. If f = “2 and since 2 is symmetric, 


f is symmetric; that is, f(x) = f(1 — x) and then 


f(x) = cs(x). 


If we use normality, then c = 1 and f(x) = s(x). Then apply recursivity to obtain 
Un(P). 
This completes the proof of the theorem. 


Remark 10.2. The (FED) has been studied by several authors under various regularity 
conditions (see [43, 13, 193, 202, 129, 530, 477, 603, 806]) such as being integrable 
in I (Tverberg [806]), measurable in JO, 1[ (Lee [603]), monotonically increasing 
in ]O, 5] (Kendall [530]), under continuous at 0 (Daréczy [193]), continuous in 7, 
nonnegative, bounded in J (Daréczy [199]), cag continuity of f(p) = uo2(p,1— 
P), Ber f(p) = 0, and f(p) is monotone in ]0, 5] (Daréczy [193], Borges [129]), 


and inten on an interval or on a set of positive measure (Diderrich [223]). 
Boundedness from one side, like nemo is not enough to characterize the 
Shannon entropy for f (x) = s(x) + —+4, D(x)’, where D is a derivative satisfying 


Te 1) 
(D) (Diderrich [224], Aczél [35]). 
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Problem. Determine all nonnegative solutions of (FEI). 
Now we find the general solution of (FED. 


Theorem 10.3. (Aczél and Daréczy [43]). The most general solution of (FEI) is 
given by 


xL(x)+U—-—x)LUd—x)+bx forx €)0, If, 


(10.8a) fQ)= é forx =Oandx =1, 


where L satisfies the logarithmic equation (L) and b is a constant. 


Proof. Without loss of generality, we can take 
(10.9) f@)=fd--x) forx el. 


Then f is said to be symmetric. 
Indeed, y = 1 — x in (FEI) gives 


fd-x)= f~)+d—2x)fC) forx eT. 
Define 


gx) =f@)+fd—-x-—fQ) forx eI, 
= 2f (x) —2xf (1). 


Then 


gt) +U-2g (-) = 2f (x) — 2xf(I) 
Si [27 (4) = ro] 
by (FE) = 2f(y)+20-y)f (=) — (2x +2y)f(1) 
= g(y)+(1—y)g (=) 


that is, g satisfies (FEI) and is symmetric. 
We can take f (1) = 0. Define 


k(x) = f@)—-— fd)x forx el. 
Then & is a solution of (FEI) with k(1) = 0. With x = 0, (FED shows that f (0) = 0. 


So, when f is symmetric, f(0) = 0 = f(1). Now, for symmetric f : J] > R 
satisfying (FED, define F : Ri. x Ri. > R by 


(10.10) F(x,y)=@+y)f (—) forx,y > 0. 
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Then 


(10.11) F(x, y) = FQ,~x) forx,y >0O (symmetric; use f symmetric), 
(10.lla) F(tx,ty)=tF(x,y) forx, y,t > 0 (homogeneity). 


Further, 
(10.11b)  F@+y,z)+FQ@,y)=FO,y+2)+FQ,2z) forx,y,z>0 
(see also [395]). Using (10.10) and (FED, 


F(x+y,z)+ F(x, y) 


=o+ytos (=) +04+n7() 


x 
x+y x+y x+ytzZ 
=(etytz | — ———_ } + ———_ 
wrrval (se) +a (BB) 
x ytz Sass 
=(x+y+z ———_ } + ———_ f {| ——+ 
oy |e(e) (=) 


=o+ytar(—)+0+04() 


= F(x,y+z)+ F(z, y) 
= F(x,y+z)+F(y,z)_ (use (10.11). 


F on R*_ x R% has an extension F; : R x R > R satisfying (10.11), (10.11a), and 
(10.11b) with F; (x, y) = F(, y) for x, y > 0 (see [43]). For this F;, there exists a 
function # : R — R such that 


(10.12) Fi(u,v) = B(u+v)— flu) — fe) foru,v ER, 
(10.13) Btu, v) = uf(v) + vB(u) u,v ER. 
Since F satisfies (10.11), (10.11a), and (10.11b), put ¢ = 0 and y = 0 in (10.11b) 


to get 
F\ (0,0) = 0, Fi (x, 0) = F\(, z) = Fi (0, 0) = 0. 


Now define operations ® and © on R? by 


(u,x)®(vo,y)=(u+v,x+y+ Filu,o)), 
(u,x)O (v, y) = (uv, uy+ovx), foru,v,x,y ER. 


Then R? with these operations is a commutative ring, and the mapping a : R? > R 
defined by 
atu,x)=u, u,x ER, 


is ahomomorphism onto R. 
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Then there exists a subring S(®, ©) of R?(®, ©) that is isomorphic to R(+, -) 
under the mapping a and a function f : R — R such that every element of S is of 
the form (t, /(t)) for t € R (see [43]). Since S(@, ©) is a subring, we have 


(u, BU) ® (v, Be) = U +0, Blu + v)), 
(u, B(u)) © (0, B(v)) = (uv, Bluv)), u,v ER. 


But using the definition of (6, ©), we obtain (10.12) and (10.13). 
Since F||R4_ x R4 = F, there isa d : Ri. > R such that 


(10.12a) Flu, v) = d(u+v) — d(u) — d(v), 
(10.13a) O(uv) = ud(v) + vd(u), foru,v > 0. 


Define L : Ri. > R by L(u) = -&) for u > 0. Then (10.13a) shows that L is 
logarithmic and (10.12a) yields 


F(lu,v) = —(u+v)L(tu+v)+uL(u)+oL(v), u,v > 0. 


Hence, from (10.10), 


(x+y)f (. —) =-(+y)L@+y)+xL(x)+yLQ). 


: a oy 
With al t,l-t= ae and t € ]0, I[, we see that 


f@) =-L@+y)+ wl) + yl) 


=-La+y)+tLQx)+0-n)L(1-1)@+4+y)) 
=tL(x)+(-t)L0 -t)-tL@+y) 
=ftL(t)+(Ud—-nL0 —-2t), fort € JO, 1[, 


which is to be proved. This proves the theorem. 


Remark 10.4. [43]. The converse also holds. That is, for F : R* x Ri > R 
satisfying (10.11), (10.1 1a), and (10.1 1b), F(5, 5) =1, f : I — R defined by 


F(—x,x) forx € JO, 1[, 
f(x) = 
0 forx =0, x =1, 


satisfies (FEI) and (10.10). 
For the general solution of (FED on open domain ]0, 1[, see Maksa and Ng [619]. 


Remark. If L is measurable, so is f, but not the converse. If (10.8a) holds for L, if 
holds for x L(x) + D(x) also, where D is a derivation. 
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10.2c Some Generalizations of (FED 


Result 10.5. (Rathie and Kannappan [688]). Suppose f, g : 1 > R satisfy 


(10.14) fay tees (Pe -)= fort eos (5 =) 
forx,y € [0, l[ with (x + y) € I and 
(10.15) gxt+y—xy)=gx)g(y) forx,yel. 


The most general solution f of the functional equation (10.14) satisfying f (0) = 
f (1) and ff) = | depending on g, where g satisfies (10.15), is given as follows. 
Tf g is constant on 0, I[, then there arise the following four cases: 
(i) When g is zero in[0,1], f(x) =1forx eT. 
(ii) When g(x) = Oforx € JO, 1], g(O) = 1, then f(x) = 1 in J0, 1[ and f (0) = 
fd) =0. 
(iii) When g is unity in [0, 1], f(x) = 1 on JO, IL and f (0) = fC). 
(iv) When g(x) = 1 forx €[ (0, 1[, gd) =0, f is as in (iii). 
If g is not constant on ]0, \[: 


(v) When g(4) = 5; f is a solution of (FEI). 
(vi) When g(4) x 5; J is given by 


(10.16) f(p)= sip) gp) for pel. 
2g (+) -1 


Proof. We treat case (vi). In this case, g(x) > 0, g(0) = 1, f() = fd) = 
0, f(x) = fC —x), and (10.9) and p = 1 — x and q = 7 in (10.14) yield 
(10.17) 
1 
f(p) +8 — p) f(g) = fea) + sat ( 


Let 


—-) forp € (0,1), q él. 
— pq 


(10.18) G(p,q) = f(p) +[¢(p) + 8 — p)lf(q) forall p,q € 0, I. 


Then, from (10.18), with the help of (10.15), (10.9), (10.17), and g(x) > 0, we 
obtain 


1 = 
Gra) = Fla) + stva)| (=) + $ Fa)| 


1 
= fo + econ) | F (7) +0 [P| po)| 


1- 
1- 
= F(pa+ eva) | f=) 4g {A=} F009] 
Pq 
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1- 1- 
= fea) + eon) | s(=*) +0 { S| pe] 
Pd Pd 
= GQ, Pp). 


Thus, we obtain (10.16). We will connect to Hf later. 
Second Generalization 


Result 10.6. (Kannappan [442]). Let f,h : [0,1[ > R, g,k : I > R satisfy the 
functional equation 


(10.19) f(x)+(—x)%g (4) =h(y) + (1—y)hk (=) 


forx,y € [0, l[ with (x + y) € [0,1], B (© 0) 42. Then f, g,h, k are given by 


g(x) = u(x) — dx? +3, xel, 

h(y) = g) +b10 — yh +p for y € (0, IL, 
f(x) =gd—x) +x? +e.(11—x)¥ +b. forx € 10, If, 
k(x) = g(l—x)+ce1x +da)\(1—x)’ +d forx €]0,1f, 


where b 1, b2, c1, C2, d,, dz are constants and u is a solution of 
i= 
(10.20) u(l—x)+(—x)Pu (4) =u(y)+ Ud —- yeu (=) 
—x = 


forx €]0,1[, y € [0, 11, with (x + y) € JO, II. 


Now we consider the most general generalization of (FEI), 


(10.21) f(x) + M(1 — x)g (4) =h(y)+M(1— y)k (= - -) , 


for all x, y € [0, I[ with x + y € [0, 1], where M is multiplicative. It turns out that 
nontrivial embeddings of the reals in the complex generate some interesting solutions 
(Kannappan and Ng [495]). 

In many applications, various special cases of (10.21) have occurred. The special 
case where f = g =h =k and M = the identity map is known as the fundamental 
equation of information and has been extensively investigated by many authors [43]. 
The case where f = g = h = k and M is multiplicative was treated in [688]. The 
general solution of (10.21) when M(1 — x) = (1 — x)/ was obtained in [442]. With 
these in mind, it is desirable to know the general solution of (10.21). Due to some 
technical difficulties, the case where M is multiplicative but not a power remained 
open. This is resolved by using embeddings of the reals in the complex, giving rise 
to new solutions. 
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10.2.5 General Solution of Equation (10.21) 


Result 10.7. (Kannappan and Ng [495]). Let f,4 : (0, 1[—> R, g,k : I — R, and 
M : [0,1] > R be functions satisfying (10.21), where M satisfies 


(10.22) M(xy) = M(x)M(y) forall x, y €}0, IL 
M(0)=0 and M(1)=1. 
Then they are given by 


f() =@U —x)+a,M(x)+b;M(1—x)+e on (0, 


> 


0, 1 
(10.23) g(x) = d(%) +aaM(x) + b2.M(1—x) +b) on [0, 1], 
h(x) = (x) +aM(x) +b3MU—x) +e — on [0,1], 
k(x) = 60 —x) + aq,M(x) + boM(1 — x) — b3 on [0, 1), 

where 


either M(x) = x? and $(x) = D(x), 
or M(x) = |p(x)? and (x) = alm g(a), 
or M(x) =x and d(x) =xL(x)+ 0 —-x)Ld — x), 
or M is anarbitrary map satisfying (10.22) and ¢ = 0, 
with constants a, aj, bj, c and function D : R > R is a derivation, L : Ry > R is 


logarithmic, and ¢ : R > C is an injective embedding satisfying 


Px+y)=0O)+ 4), Py) = o)(). 
The converse is also true. 


Corollary 10.7a. (Kannappan and Ng [489], Maksa [618], Aczél and Ng [58]). The 
most general measurable solution of 


(10.21a) fa)+-xg (4) =hg)y l= yk (+ ~~). 


forx,y € [0, l[ with (x + y) € I, where f,h : [0,1[ > R, and g,k : I > Rhas 
the form 

f(x) = as(x) + bix +d, 

g(y) = as(y) + boy + bi — ba, 

h(x) = as(x) + b3x + bj + bo — b3 — bg +d, 

k(y) = as(y) + bay + b3 — b2, 
for x € [0, 1. and y € I, where s is the Shannon function and a, b,, b2, b3, b4, and 
d are arbitrary constants. 
Corollary 10.7b. When all f,g,h,k are the same in (10.21a) (that is, when f 
satisfies (10.21a) and is measurable) it is easy to see that 


(10.8) f(x) = as(x) + bx 


for some constants a and b. 
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Corollary 10.7c. (Ebanks, Kannappan, and Ng [238]). Let Jj = ]0, 1[" be the 
n-dimensional open unit interval and C = {(x, y) | x, y,x + y € Io}. Now we give 
the general solution of the functional equation 


(10.21b) f(x) +M(—x)g (4) = h(y) + M( —y)k (=) 


for all (x,y) € C, where f,g,h,k,M : Ip — R are unknown functions with 
M multiplicative, i.e. M(xy) = M(x)M(y) for all x,y € Io. Here, all vector 
operations are to be done componentwise and 1 stands for (1,1,...,1) € R”. The 
special case f = g =h =k is known as the n-dimensional fundamental equation 
of information of multiplicative type. 

The general solution of equation (10.21b) with nonzero multiplicative M is given 
by 


f@) = M@)L@)+ M0 —x)L0 — x) + mM) — mM -— x) +05, 
g(x) = M(x)L@)+M1—x)LU-—x)+mM(x)4+ m, 
h(x) = M@)L(~)+ MU — x)L0 — x) + mM) — naMC — x) +05, 
k(x) = M(x)L(x) + M(L —x)LOU — x) + 93M (x) + 4, 


for additive M; or 


i) =U — x) + Ig —*) + Lae) + 9 + 3; 
g(x) =Lod —x)+Lia)+m, 
h(x) = Lod —x)+ L230 —x)+ Lis) +m +73, 
k(x) = Lol —x)+L3@)+m, 


for M =1; or 


f(x) = —a(x) + 74M (x) — 13M — x) +6, 
g(x) = a(x) + mM) + mM —-x) +93, 
h(x) = a(x) + mM (x) — nsM(1 — x) + 16, 
k(x) = —a(x) + naM(x) + 12M(1 — x) +45, 


for M # 1 and nonadditive, where M and a have one of the following forms: 


Gh Bays ot) =) as a) SE Or some ined & 
(ii) a(G1, 2, ---5 Gn) = nIMd(G), MG, 2, ---, Gn) = |G)? for some fixed k; 


(iii) a(C1, €2,.--5 En) = (Cj —&k), MQ, 2, --- Gn) = C70 for some fixed j # k; 
(iv) a= 0. 


In the above, the L’s are logarithmic and the n’s are constants. The map d 
is a derivation, and Im@ stands for the imaginary part of a nontrivial embedding 


g:R>C. 
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Remark. [442, 495]. The proof involves solving the equations 


m(x)+m(y)=M(1—x)m (4) +M(1—y)m (=) ; 
1-x l-y 
m(x) +m —x)=0, 
M(uv) = MUu)M(v), 
for all x, y,u,v € JO, I[ with x + y € JO, 1[, where m (4 0), M :]O, 1[ — R. The 


solutions are 
M(x) = x and m(x) = D(x), 


or M(x) = |o(x)|* and m(x) = alm¢(x), 
or M is arbitrary and m = 0, 


where D is a derivation and ¢ is an embedding of R into C. 


Before treating the sum form equation (SFE), we give two results connected to 
recursivity. 


(i) Kaminski and Mikusinski [418], from 3-symmetry, 3-recursiveness, and expan- 
sibility and 


Pi P2 
13(P1, P2, P3) = H2(pi + p2, p3) + (pi + p2) 2 ( : —_) 
Pitp2 pit p2 
Pl p2 
= 43(pi+ p2,0, p3) + (pi + pas ( ; 0), 
Pit p2 pit p2 


derived the functional equation 
(10.24) A(x, y,z)= A(x+ y,0,z)+ A(x, y,0), 


called the entropy equation, where H : C = {(x, y,z) € R3 : x, y,z>0, xy+ 
yz+zx > 0} — Ris defined by 


x y Zz 
A(x,y,z)=(a+y+z oS. oo oo is 
syne) y uo (—— Xt+y+zZ — 


Evidently, H (Ax, Ay, Az) = AH (x, y,z), 4 > 0 (A is positively homogeneous 
of degree 1). 

It is proved [418] that if H is continuous, symmetric, and positively homoge- 
neous of degree 1 on C and satisfies on the interior C° of C the functional equa- 
tion (10.24), then 


H (x,y,z) = (x+y +z) log@ + y +z) —xlogx — ylog y — zlogz. 


A simpler proof connected to (FEI) is given in [17]. Also, the general solution of 
(10.24) is given by 


At,y,2a=@+yt+zLat+yt+z)—xL(@) -— yL(y) -— zL(z), 


where H defined on C is positively homogeneous of degree 1, satisfies (10.24), 
and is symmetric, and L is logarithmic. 
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(ii) Instead of the recursivity (10.7), assume the general version 


(10.7a) Mn (Pi, P2,-+-5 Pn) = Un—-1(P1 + P2, P3,+++s Pn) + Pn—-1(P1, P2)s 


where $,— 1 is a function of (pj, p2) defined on S = {(x, y): x,y > 0, 0 < 
x+y < 1}. Since this axiom is weaker than (10.7), it is necessary to supplement 
it with additional axioms. As a matter of fact, we prove the following theorem. 


Theorem 10.8. Suppose ja is symmetric, Un is expansible, (2,n) is strong additive, 
(10.7a) is recursive with $2 continuous on S, and 25, 5) = 1. Then uy,(P) = 
H,(P) for P € VT, with pj + p2 > 0. 


Proof. It is enough to show that 


y 
xty' x+y 


Pn—1(%, y) = (x + aA ) for (x, y) € S. 


Then the result will follow from Theorem 10.1. 
As the first step, we will show that ¢,— is independent of n and ¢,—1(p1, p2) = 
3 


$2(P1, p2). For (pi, p2) € S, choose p3 such that > pi = 1. Then, using expansi- 
i=1 
bility, we get 
Pn—1(P1, P2) = Hn(P1, P2, P3,0,...,0) — Un-1(P1 + p2, p3,0,...,0) 
= 43(P1, P2, 3) — H2(P1 + Pa; P3) 
= $2(P1, p2). 
Next we prove that “,, is n-symmetric by induction. Since 4 is symmetric, so are 


2 and 3 (use expansibility). Thus ¢2 is symmetric. Suppose “,—1 is symmetric 
forn > 3. Evidently, “7, is symmetric in pj, p2 by (10.7a). Further, from 


Hn(P1, P2> +++» Pn) = Hn—-1(P1 + P2, P3, +++» Pn) + $2(P1, P2)s 
by the induction hypothesis, wy is symmetric in p; (i = 3,4,...,2). Now fy will 
be symmetric in p2, p3 provided 
Hn—1(P1it P2s P3, +++» Pn) +O2(P15 P2) = Mn—-1(P1+ P3, P25 +++ Pn)+$2(P1, P3) 
holds; that is, if 


f2(p1 + p2, p3) + $2(p1, p2) = d2(p1 + ps, p2) + $2(P1, Ps) 


holds, which indeed is true from 74. Hence “1, is symmetric. Now we will show that 
$2 is additive. From n-symmetry, (10.7a), and (2, 4), (2, 3) additive, we have from 
Lg(pip, p2U — p),..-, pal — p)) that 


4 


(10.25) > ¢2(vip, pill — p)) = u2(p,1— p) for pe I, 
i=1 
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and from “6((pi + p2)P, (pi + p2)(1 — p), p3p,---» pal — p)) 


4 


go((pi + p2)p, (pi + p2)(1 — p)) + >. d2(pip, pil — p)) = 2p, 1— p). 
i=3 


Thus 
f2(pi + p2)p, (pi + p2) — p) = d2(pip, pid — p)) + ¢2(p2p, p20 — p)), 


which is additive in the sense that if, for fixed p, we define Ap(x) = ¢2(xp, x1 — 
P)), forx eT, 


Ap(pit p2) = Ap(pi) + Ap(p2) for pi, p2, pi + p2 € I. 
Since ¢2 is continuous, so is Ap, and from Theorems 1.1 and 1.2 it follows that 
p2(xp,x(1 — p)) = xAp(1) = xh2(p, 1 — p). 


But then (10.25) gives ¢o(p, 1 — p) = u2(p,1— p). 
Set xp = p1 and x(1 — p) = pz to get 


P1 P2 
P1 + p2’ Pit p2 


f2(P1, p2) = (pit p2)H2 ( ) » (p11, p2)€S. 


This proves the result (see also [196, 204, 13]). 


Now we turn our attention to the other equation, the sum form equation (SFE), 
and connect it to (FEI). 


10.2d Sum Form Functional Equation (SFE) and 
Its Generalizations 


Ever since Chaundy and McLeod [149] considered (SFE), which arose in statistical 
thermodynamics, it has been extensively studied (see [41, 204, 432, 448]). First we 
will obtain the continuous solution of (SFE) and then its measurable solution holding 
for the fixed pair of integers m,n (> 3). 


10.2d.1 Representation and Characterization 


A sequence of measures “ : A, — R is said to have the sum property if there exists 
a function f : J > R such that 


(10.26) Un(P) = >> f (pi) 
i=l 


for P € An (n => 2). In this case, f is said to be a generating function of {Un}. 
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Theorem 10.9. (Kannappan [432]). Suppose un : Tn — R is (m,n)-additive and 
has representation (10.26) holding for all m,n (= 2) with continuous generating 
function f. Then f has the form 


(10.27) f@)=cxlogx forx el, 


and [n(P) = cH),(P) for P € Tn. 


Proof. (See also Aczél and Daréczy [43] and Chaundy and McLeod [149].) From 
(m, n)-additive and the sum representation (10.26), we obtain the (SFE) for (x;) € 
Vn, (vj) € Pm. Now we describe all the continuous solutions of (SFE) by giving a 
simple proof. 

Define a function 


1 
(10.28) h(x) =xf (<) for all real x > 1. 
x 


Evidently, 4 is continuous. Let m, n be any integers > 1. Then putting x; = 1/n @ = 
1,2,...,n) and yj; = 1/m (j = 1,2,...,m) in (SFE) and using (10.28), we get 


(10.29) h(mn) =h(m)+h(n) _ for positive integers m,n > 1. 


It is known that h in (10.29) has a unique extension to the multiplicative group of 
positive rational numbers. The question is whether this extension is the same as / in 
(10.28). That it is indeed the case is shown as follows. 
Taking any rational r € ]0,1[ as m/n (m < n) and letting xj = m/n, x2 = 
= Xn—-m41 = 1/n, and yj = --+ = ym = 1/m in (SFE), we obtain (by taking n 
asn —m+ landmasn) 


i 1 wn 1 1 
(10.30) mf (-) + (n—m)mf (—) =f (=) +(n—m)f (-) +mf (-) 
n mn n n m 
Using (10.28), (10.29), and (10.30), we find that 
(10.31) h (-) =h(n)—h(m) form <n. 
m 


From (10.29) and (10.31) there results that 
(10.32) h(xy) =h(x)+h(y) forall rationals x, y > 1. 


The continuity of 4 implies that (10.32) holds for all real x, y > 1. It follows that 
h is logarithmic and h(x) = —clog.x for some real constant c. This together with 
(10.28) gives the required solution f of (SFE). From the representation (10.26) 
follows u,(P) = cH,(P). This proves the theorem. 


Remark. The solution (10.27) of (SFE) is obtained in [41] holding form = n (> 2), 
in Daréczy and Maksa [204] form = 2, n > 3, and in [43] form = 2, n = 3. Now 
we obtain the solution of (SFE) holding for fixed m,n, showing that the solution 
depends on m and n also. 
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Now we consider the measurable solution of (SFE) holding for a fixed pair 
m,n (> 3). 
By letting 


(10.33) F(x, y) = f(xy) — yf(x) — xf) 


for x, y € [0, 1], we see that F is measurable in each variable and that (SFE) takes 


the form 
n m 


> > Gy) =0 


i=1 j=1 


for (x;) € Tn, (vj) € Vm, holding for some fixed pair m,n (> 3). Then F is given 
by [448] 


F(x, y) = F(O, y)\( — nx) + F@,0)0 — my) — FO, 0)(1 — nx) — my) 
and here 


= fO)A—y)A —nx) + FO) — x) — my) — FO) — nx) — my). 


Thus we obtain 


f(xy) = yf ®) + xf0) + FMA — y)d —nx) + FO)A — x) — my) 
f(0)U — nx). — my) 


for x, y € [0, 1], which can be rewritten as 
fy) = yf a) +xf(y) +d[1-—x -—y+(m+n-mn)xy], 
where d = f (0); that is, 
f(xy) —d —d(m+n—mn)xy = x(f(y) — 4) + y(f@) - 4) 
or 
(10.34) (xy) =xk(y) + yk(x) forx, y € [0, 1], 


where 
h(x) = f(x) -—dd+m+n—mn)x, k(x) = f(x) -d, 


are measurable functions. 
Now (10.34) can be rewritten as 


hoy) _ RO) KU) 
xy y , 


x,y €]0, 1], 


a Pexider equation, so that the measurability of the functions implies 


k(x) = bxlogx+cx, x €]0,1), 
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where b, c are constants; that is, 
(10.35) f(x) =bxlogx+cx+d, x €]0,1). 


But obviously (10.35) holds for x = 0 in view of f(0) = d and the fact that 
OlogO = 0. 

Now, f given by (10.35) is a solution of (SFE) provided c = (mn — m —n)d. 
Thus we have proved the following theorem. 


Theorem 10.10. (Kannappan [432]). Let f : [0,1] — R be measurable and satisfy 
(SFE) for a fixed pair m,n (= 3). Then f is a solution of (SFE) if and only if 


(10.27a) f(x) = bx logx + d(mn —m —n)x +d, 
where b, d are constants. 


Remark 10.11. We will show the close connection between (SFE) and (FEI). As a 
first step, we prove the following lemma. 


Lemma 10.11a. Let fj : 1 > R G@ = 1, 2,3, 4) satisfy 
(10.36) fi@y) — fl —x)y) — fa) = yfa) forx,y eT. 
Then k(x) = fa(x) — (f4(0) — faC)x — fa(0)) is a solution of (FEI). 
Proof. Set x = 0 and x = | separately to obtain 

fi) — fay) — AY) = yfaO), 

fi(y) — f20) — AO) = yfaQ). 


Substitution of {3(y) and f2(y) from these equations in terms of f{(y) in (10.36) 
yields 


(10.37) f(xy) + f(A —x)y) -— FO) = yk(x) forx,y € J, 


where f(x) = fi(x) — f(O) and k(x) is as given above. 
With 


g 
(> )- SF) e( —) 
oo y Pons): 
from (10.37) (using y = 4 and y = 45%) we get 
oy + =a (PL) = lr (3) +4()-+(5)] 
tale@)+s (=) -1()] 
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Since the right side is symmetric in x and y, k satisfies (FEI). This proves the 
lemma. 


Now we obtain (FEI) from (SFE). Suppose f : [0, 1] — R is measurable and is 
a solution of (SFE) holding form = 2, n =3 with f(0) =0= f(1). 
For p € I, take x1 = p, x2 = | — p, so that (SFE) can be written as 


3 
(10.38) S°Lf(py;) +f — p)yjl = f(P) + FA — p) + FO + £02) + £3). 


j=l 


For fixed p, define A(y) = f(py)+ f(Ud-p)y)-fm—-y\FP)+fU-p)), y € I. 
Note that A(O) = 0. Then (10.38) becomes A(y;) + A(y2) + A(y3) = 0, which is 


an “additive” equation on J. Since f and thus A is measurable, A(x) = c(p)x for 
x € I. Since A(1) = 0 = c(p), we have 


0= A(y) = f(py) + f(d — p)y) — fQ) = y(f(p) + fd — p)), 


which is (10.37) with k(x) = f(x) + f(1—~x). Hence f(x) + f (1 — x) is a solution 
of (FEI). 

With some manipulation from f(x) + f(1 — x) = cs(x) and (10.37), we can 
obtain f(x) = cx log x. 

In Daréczy [194], the measurable solution of (SFE) holding form = n = 2 is 
obtained. 


Result 10.12. Let f :]0,1[ — R be a measurable solution of 


f(pqay + f(A = p)g) + fp — 9) + FIG — p)\d —9)I 


10.39 
nen =f) a Fl=p+iorsd=H 


for p,q € ]0,1[. Then f is bounded and integrable in each closed interval and 
infinitely differentiable (it need only be six times differentiable) and has the form 


(10.40) f@Mm= b(4x3 = Oy? & 5x) +cxlogx+d, x €]0, I[, 


where b,c, d are constants. 
The measures [Ln having the sum property with a measurable generating function 


f ave (2, 2)-additive if and only if 


Un(P) = 4bH}(P) —9bH2(P) +cHn(P)+dn for P € 1°. 


Some Generalizations 


Result 10.13. (Kannappan and Ng [496]). The measurable solutions fj : \0, 1[ > 
R (@ = | to 6) of the functional equation 


(10.39a) fi(pq)+ fo(pU—q))+f3(— p)q)+ fa — p)\U—4)) = fs(p)+ f(a) 
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for p,q €\0, lL are given by 
fi(p) = 4ap’ + (b -9a)p? + cip +cplogp +ejlogp+di, i=1,4, 
fi(p) = 4ap’ — (b+ 9a)p? + cip +cplogp+eilogp+di, i =2,3, 
fs(p) = —6ap* + (6a — 2b + co — c4)p + cp log p + (1 — p) log(1 — p)) 
+ (e1 + €2) log p + (e3 + e4) log(1 — p) +ds, 
fo(p) = —6ap* + (6a — 2b + c3 — c4)p + cL plog p + (1 — p) log(1 — p)] 
+ (e1 + €3) log p + (e2 + e4) log(1 — p) + do, 
where a,b,c, cj, dj, e; are constants with 
4b+c,—c2-c3+c4=0 and b—S5a+c44+d,+d2+d3+d4=d5+ do. 


This solution is obtained by determining the measurable solutions of 


(10.39b) f(pq) + fp —9g)) + f(A — pig) + f(U — p)d — q)) = 0 


and 


(10.39c) — fi(pq) + fate — 9) + fa. — p)g) + fal — p)U— 9) = 0 
as f (p) = 4bp — b and 
Be = bp? +(a—b)p—clogp+d;, i= 1,4, 
fi(p) = —bp? + (a+b)p+clogp+d;, i=2,3, 
where a, b,c, d;’s are constants with a + dj + d2 + d3+d4=0. 
Problem. Determine the general solutions of (SFE), (10.39), and (10.39a). 


Result 10.14. (Kannappan [448, 436], Kannappan and Sahoo [501]). Suppose g : 
I — Ris measurable. Then g is a solution of the equation 


m n m n 


0.41) SoS giv) = Digi) + >> 80) +e: >) 8G): >> gy) 


fi je) i=1 j=l 


holding for fixed m,n = 3 (for (xi) € Vm, (yj) € In), ¢ 4 0, if and only if g has 
either the form 


(a-1 b 
(10.42) g(p) = eee aa 
iC 6 
or 
p — p 
(10.42a) ep) =2—*, forpel, 


where P is an arbitrary constant, while the constants a, b satisfy the equation 


(a + mnb) = (a+nb)(a+mb). 


430 10 Functional Equations from Information Theory 


Remark. ps b = 0 = g(0) (then a = 0), and g(1) ¥ O, then (10.42) yields 
g(x) = —# (a solution independent of m and n). Note that c = 0 gives (SFE). 
Now we consider the functional equation 


n m 


(10.43) >> fia) = di 21a) + Lay LSD, 


i=1 j=1 
where P € T,, Q € Tm, a, 8 (F 0), f : J0,1[ — R, and obtain the following 
result [436, 502, 443]. 


Result 10.15. Let f : ]0,1[ — R be measurable. Then f satisfies the functional 
equation (10.43) for all P € TY, and Q € Ty with arbitrary but fixed m,n (= 3) if, 
and only if, 


a(p* — p*), a # B, 
f(p) = japlogp+bimn—m—n)p+b, a=fp=1, 
ap* log p, a=f (41), 


where a, b are arbitrary constants. 


Result 10.15a. (Kannappan [456]). The most general solution of the functional 
equation (10.43) holding for all P € Tn, Q € Tm with an arbitrary but fixed pair of 
integers m,n greater than 2 is 


f(y) = A(p) + d(p* — p®), a# B, 
A(p)+ L(p)p*, a=fFl, 


where A is additive with A(1) = 0 and L : 0, 1[ > R is logarithmic. 
More Generalizations 


Result 10.16. (Kannappan [452], Ng [649]). Let fij, gi,hj :1 > R@=1,2; j= 
1,2,...,n) be measurable and satisfy the equation 


(10.44) Sore = Ye (pi) + Shia 


i=) fet 
for fixed n = 3. Then fij, hi, gj are given by 
(10.44a) 
fij(x) = ax logx + (b+ dj)x +d1;, 
f2oj (x) = ax log x + (dj +c)x +d2j — d1;, 
hj(x) = ax logx + djx + dz; — cj, FH gn 
gi(x) = g(x), arbitrary, 
go(x) = —g(1 — x) + alxlogx 
+( —x)logd —x)]+ (6-—c)x+c+d, 


where a,b, c,d, Cj,dj;,d\;, and d2; are arbitrary constants. 


10.2 Notation, Basic Notions, and Preliminaries 431 


Next we treat the functional equation 


(10.45) > Ga 7 Dela Da mA (pi) 


i=1 j=1 


for P € Im, Q € Tn, fij,hi, gj : 1 — IR measurable and obtain the following 
result. 


Result 10.17. [443]. Suppose (10.45) holds form = 2, n = 3. Then they are given 


by 
fij(x) = ax logx + (b+ dj)x +d1;, J = 1,2,3, 
f2oj (x) = ax logx + (dj + C)x + d2;, JH 1,233; 
g3(x) = ax logx + (dj —d)x + d\j + dz; — bj, for j =1,2,3, 
h(x) =h(x), 
ho(x) = —h(1 — x) + al[xlogx 

(10.45a) + —x)logd —x)])-—(b-C)x+b, forf=1, 
fig) = @t b)x8 + djx + dij, PS 233, 
foj(x) = @t C)x8 + djx +do;, j= 1,2,3, 
gj(x) = ax? + (dj —d)x +d\j + do; — bj, (212.2. 


hy (x) + h(x) = a[x4 + (1 —x)4 - 1] 
+bx? + CA — x), 
where a,b,c,d,dj,dij,bj (i = 1,2, j = 1,2, 3) are constants with b3 +b) +b2+ 
d=0. 


Remark. Here again we connect the “sum form” equation and “fundamental” equa- 
tion. The proof uses the (measurable) solution of the equation 


(10.36a) f(xy) — g(1—x)y) —h(y) = yPk(x), x, ye, B £0, 


where f, g,,k : I > R (see (10.36)). If we define kj (x) = k(x) —cQ — xic= 
g(0), then k; satisfies [456] 


k(x) + —x)Pky (4) = ky(y) + (1 — yy (=) 


and its solutions are 


ee a[xlogx + (1 —x)log(dl—x)]+ bx forf=1, 
Pee Veh aol a a Bue for B £1. 


Another generalization of (SFE) is the equation [443, 436, 516] 


n m m m 


(10.46) YL ivteay= Dot Dasitai) + Dea A (pi). 


7=1 j=1 


where PET,, OE Tm, anda, f (40) ER. 


432 10 Functional Equations from Information Theory 


Result 10.18. (Kannappan [456]). Let fij,gj,hi : 1 — R be measurable i = 
1,2,...,n, j = 1,2,...,m). Then these functions satisfy the functional equation 
(10.46) for fixed m,n (= 3) if and only if they are of the form given by 


fij(*) = ajx® + bix? + dijx + Cij, 
(10.46a) gj(x) = ajx® — CxP + djx + ej, fora # B, 
hji(x) = Cx*® + bjx? + ajx + Vis 
i= 1,2,...,n, j = 1,2,...,m, where aj, bj, C, dij, cij,dj,ej, Gi, yi are con- 
stants satisfying some conditions, or are of the form given by 
fij &) = ax® logx + (bj + Ci)x® + dijx + cij, 
(10.46b) gj(x) =ax* logx + bjx* +djx + e;, fora =f, 
hj(x) = ax logx + Cix® + ajx + Bi, 


fori =1,2,...,n, j =1,2,...,n, where the constants a, bj, Cj, dij, Cij, dj, ej, Gis Yi 
satisfy some conditions. 


10.2e Other Measures of Information—Entropy of Type /, Hf 


In Havrda and Charvat [361], the entropy of type # (10.3) HE (P), fb # 1 was 
introduced (see also Daréczy and Kiesewetter [202] and Kannappan [448]). 


(i) Suppose “iy : T, — R is 3-symmetric and n-recursive of type /; that is, 


Ln(P1, P2,+++5 Pn) =Un-1(P1 + P2, P3,--+5 Pn) 


Pi P2 
+ (pi + p2)? 2/ ; ). 
pee Pit p2 Pit p2 


with B #1, pit p2 > 0, w2(4,4) = 1. Then p»(P) = HP (P). From 
3-symmetry and 3-recursivity of type / there results the functional equation 


f@)+d = (Fo -)= fo)+a -»t (; ). 
where f : J] > Ris defined by f(x) = w2(x,1—~x), which is a special case of 
(10.14), and the solutions are given by (10.16), f(x) = ai 1 Gey @ =x) = 


1) = sg(x). Then recursivity yields u,(P) = Hf (P). Note that no regularity 
condition on f is needed. 
(ii) Hf is not additive but nonadditive, satisfies 


(10.47) HE (P * QO) = HA(P) + HA(Q) + cHf(P)HA(Q), 


for P € T,, Q € Im, and has representation with f(p) = c(p’ — p), where 
c=2!-8 -1, 
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n 
Suppose #, : [, — R has representation “n(P) = >) f(pi) and satisfies 
i=1 


(10.47). Then f satisfies (10.41) and is given by (10.42a), f(x) = 4(xF —x), xel. 
Thus w2(P) = HP (P). 


10.2f Directed Divergence (dd) and Inaccuracy (KI) 


The measures of information known as directed divergence (dd) (Kannappan and 
Sahoo [510], Kannappan and Ng [491]) and inaccuracy (Kerridge [532]) are given 
by D, and I, respectively. There are many algebraic properties satisfied by D, and 
I,, such as symmetry, recursivity, and additivity, among others. 


Postulates a 


Let un : T 2 — R (a = 2) be a sequence of functions satisfying the following 
postulates. 


(i) Symmetry. “,(P||Q) is symmetric. 
(ii) Branching. 


Ll Pis+++>5 Pn =p 1 Pit P2, P3,--+5Pn 
2 15+-++54n _ q1 + 42, 935+++54n 
P1 p2 


+ (or tpnia( PE PEP), Pit p2,91+492>0, 
qitq2’ girq2 


with 


(iii) F(p,q) = K2 (’ 7 4 measurable for (p,q) € J, where 


J = ]0, 1[ x ]0, 1[ Uf, y)} U {G, z)} for y € JO, 1[, z € JO, 1]. 
Then 
(10.48) F(x, y) = as(x)+ d[x logy +  — x) log(1 — y)], 
where a and d are arbitrary constants. 


From (i), (ii), and (iii) can be derived the functional equation 


Fes.) +r (74) 
(10.49) 
=F(tu,v)+U—-u)F (=. =) for (x,y) € J, 


a two-variable version of (FEI). 
Now we obtain the most general solution of (10.49) [442, 450, 491]. Let F : 


J — R satisfy the functional equation (10.49) for x, uw € [0, 1[ with x + u € J, for 
y,v, y +0 € Ig, where J = 7 x In U {(0, 0), CJ, 1)}. 
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Without loss of generality, we can assume F is symmetric; that is, 
Fax,y)=FU-x,l-—y) for(@,y)e VJ. 


The function H (x, y) = F(x, y) — F(., 1)x also satisfies (10.49) and that H is 
symmetric. 

So, let F' satisfy (10.49) and be symmetric. For fixed v, y € ]0, I[, (10.49) can 
be rewritten as 


(10.21a) g(x) +(1—x)h (+) = k(u) + (1—u)N (—) 
1-x l-u 


(which is a special case of (10.21)) for x, u € [0, 1[ with (* + u) € I, where 


g(x) = F(x, y), A(x) = F(x. a k(x) = F(x, v), N(x) = F(x. =). 
=y¥ =p 


1 
As in (10.22), for (x, y) € D, we have 
g(x) = s_~(x) +a1x + a2 forx € 7, 
h(x) = six) + bix + bg for x € Ip, 
k(x) = sx(x) + 1x +b2 +1 — a1 — a2 for x é (0, 1[, 
N(x) = sz (x) + (b1 +.a2)x —c1 +a; +a. forx € Io, 


where s; (x) is given by (10.16) and a; and b; are functions of v and y. Thus F is of 
the form 


(10.50) F(x,y) =xL(x)+ Ud —x)L0 —x)+aQy)x + d0), 


for x, y € Io, where a and b are functions of y only (use the forms of g and k so that 
a and b are functions of y only). 
First, using the symmetry of F,, we have 


a(y)+aQ_—y)=0 
and 
(10.51) a(y) +b(y) =bU—y), fory € Io. 


Substituting this value of F into (10.49) and equating the coefficient of x and the 
constant terms, we obtain 


(10.52) ay) -0() =a 2 ) 
l-y l-o 


and 


(10.53) b(y) +b () = b(v) +b (=) 
l-y 1-ovb 


fory,vo,y+ov elo. 
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Use (10.51), (10.52), and (10.53) to get 


bl —y) = bo) +6(1- : ): 
i) 


that is, b satisfies the Cauchy functional equation (logarithmic) 
b(— pq)=bU— p)+bU—q) for p,q € Io. 


Taking (x) = b(. — x) for x € Io, it follows that € is logarithmic and from (10.51) 
that 


a(y) = €(y)—-€U—y). 
Thus, F is of the form 


(10.50a) F(x,y) =xL(x)+(—x)LUd —x)+x€0) +d —x)€(— y) 


for x, y = Io, where L, ¢ satisfy (L). 
Substitutions x =O0= y; u=1—x,v=1-—y; x =0; u=1—~x in (1049) 
yield 
F(0,0) =0= F(,1); FO,y)=€U—y); Fd, y) = €(y). 


Thus (10.50a) holds for (x, y) € J. 


Remark. If F : J — R satisfies the functional equation (10.49) only (no symmetry), 
then the most general form of F is given by 


F(x, y) =xL (x) + (1 —x)LU — x) + x€() + C—)E( — y) $x, 


where c is an arbitrary constant and L, ¢ are solutions of (L). This proves the con- 
jecture in Kannappan [478]. 


The measurable solutions of (10.49) are given by (10.48) (which follows from 
(10.50a)). 
To obtain directed divergence D,,, use 


vat. 4 (2/319) = 4, 
natty. 2 (1/2172) —o 
so thata = 1 = —d and 
F(x,y) = xlog~ +(1 ie 
y (l—y) 


Mn(P||Q) = Dn(P||Q). 
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To obtain inaccuracy J, use 


3 1 0 
Unit. u2 1/2 i) = 1, 


i21P).. 
Me Te ee 


so thata = 0, d= —1, 
F(x, y) = —x logy — (1 — x) log(1 — y), 
Ln(P||Q) = In (P||Q). 


Postulates b 
Let Ln: Beg — R be a sequence of functions satisfying the following properties: 


e@ pn(P||Q) is symmetric. 

e (m,n)-additive. Umn(P * Q||R * S) = uMm(P|R) + “n(QI|S), PLR € 
Tm, Q,S € Vn, forallm,n > 2. 

e Representation-sum property. There exists a continuous (or measurable in 
each variable) function g : [7 — R such that 


(10.54) Hn(PIQ) = > e(pi.gi), P,QETn 
i=1 


(f is called a generating function of the sequence {,,}). Then g has the form 
(10.54a) g(x,y) =axlogx + bxlogy+cylogy +ajx +a.y+d 

for (x, y) € J with aj + a2 = (mn —m — n)d. 

From the (m,n)-additivity and the sum representation there results the “sum 


form” equation 


m n m n 


(10.55) > di eas. 715/) = > epi.) + D8 @j,5)): 


i=1 j=l i=1 j=l 
whose measurable solution is given by (10.54) (see Kannappan [452, 443]). 


To illustrate the link between the “sum form” (10.55) and the “fundamental equa- 
tion” (10.49), consider the equation (10.55) holding forn = 2, m = 3, 


3 3 
>) 8(p-rj.9°5;)+ > 8(— p)rj, 0 — 9)s;) 
(10.55a) 7 = 
= g(p.q)+ 80 —p,1—q) + >. 8(rj,5;), 
j=l 


for p,q € I, R, S € T3. Then g has the form (10.54a) with a; +b; = d [452, 494]. 
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G(r, 5) := g(pr,qs) + g(U — p)r, A — 4)s) 


(10.56) 
—rilg(p,q)+gd—p,l1—q)]—2e@s) 


for (r,s) € J =I x In U{(O, 0), C1, 1)}. Then G satisfies 2 G(r;, 5;) = c, and by 


[452], G(r, s) =ar+bs+dwitha+b+3d=0, wei a, 'b, and d are functions 
of p and q. First, let r = 0 = s to get g(0,0) = d and thenr = 0 to obtain 


g(0, qs) + 80, U — q)s) — 80,5) = bs +d = bq) +4, 


which is of the form (10.36) [452]. Hence, from [452] we have b(q) = —as(q). 
Then (10.56) becomes 

(10.57) 

K (pr, qs) + K(. — p)r, —9)s) — KG, s) = r[K(, q)+K0— p,1—9)—4], 


where 
K(r, 5) = g(r, s) —ajslogs—d_ for(r,s) € J. 
Define 
(10.58) F(x, y)=K@,y)+K0 =%,1=y)=d, Gye. 
Then 


Fis.) + = a)F (Te) 
= K(x,y)+KU-x,1l-—y)-d 


i=3)| Kb = a= a ag 
1-—x 1-x 1-x l-y 


=K(x,y)+K(1—x,1-—y)—-d+KU,0d) 
+K(Q-u-—x,l-—v-—y)—K(U—-x,1-,). 


Take r = 1 — x, s = 1 — y in (10.57), which is symmetric in x and uw and y and v. 
Hence F satisfies (10.49). 


This shows the connection between these two types of functional equations. Note 
that F is measurable in each variable and is symmetric, and F is given by (10.48). 
From this F and (10.57) follows K (x, y) = —a2x log x —a3x log y—a4x —asy — a6, 
and g is given by (10.54). Then, from the representation, we get “,(P||Q). For 
various characterizations of directed divergence, see Kannappan and Rathie [497] 
and Kannappan [478, 444]. 


10.2f.1 Generalized Directed Divergence 


Now we consider generalized directed divergence, GD, briefly (Ng [649], Aczél 
and Nath [56]). 
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Let un: r — R (n = 2) bea sequence of measures. 


(i) As before, using the properties of 3-symmetry, n-recursivity, or branching with 


measurable 
P, 1— P 
A(p,g,r)=H2)9,1-¢], for(p,g,reK, 
r,l—r 
where 


K =)0, 1[ x ]0, I[ x ]O, 1[ U{@, y, z)} U{G, y’, 2D}, 
for y,z € [0, 1[, y’,z’ € ]O, I[, we obtain the “fundamental” equation 


u 0) w 
A(,),2) + (=x)0 | —, ——, —— 
l-x l-y 1l-z 
(10.59) . 
= H(u,v, w) + (1—wH (——, ~~, *_ 
l—-u l—-v l-w 

for (x, y,z) € K. The general solution of (10.59) is given by [450] 


A(x, y,2) =si(x) +x€i(y) + Ud — x)€idl — y) + x€2(z) 


10.59a 
eee +(l-x)f2(1-2), for(x,y,2€ K, 


where L, £1, £2 are logarithmic, s7(x) is given by (10.16), and the measurable 
solution is given by [490] 


H (x,y,z) =a(x logx + (1 — x) log — x)) 
(10.59b) + b(x log y + (1 — x) log(1 — y)) 
+c(«logz + (1 — x) logd — z)). 


To get the form GD,, use nullity H(3, 5; 5) =0= H(3, 7 x). so that 


— 
(1 —z) 


H(x,y,z)=6 | rtos + (1 —x)log 
Zz 


and un(P||Q||R) = GD, (P|| Q||R) (from recursivity). 

(ii) There is a real-valued continuous (measurable or measurable in each variable) 
F : K + Rwhere K = 13 U{(0, y, z)}U{(@, y’, z’)} with y,z € [0, I[, y’,z’ € 
JO, 1] such that the following hold: 


nN 
e Representability. “,(P||Q\|R) = >° F(pi, qi, ri), for all n > 2. 


i= 
e (m,n)-Additivity. mn(P*U ||Q*V ||ReW) = un(PI|Q|R)+um (UV IW) 
for P,Q,RET,n, U,V,W ET yn. 
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These two properties lead to the “sum form” equation 
n m n m 
(10.60) 2) >) F(piuj,qivj.riwj) = >° F(pi.gi.ri) + > F(uj,0j, vj), 
i=1 j=1 i=l Fl 
P,Q,R € Tn, U,V,W € Im, (pi. gi,ri) € K, whose measurable solution is 
given by 
F(x, y,2) = s(x, y,z) + dix + doy + d3z + dg 
with d, + dz + d3 = (mn — m — n)da, where s(x, y, z) is given by (10.1c). 
Remark. As before, we can link the “sum form” (10.60) to the fundamental equation 
(10.59) (see Kannappan [455]). 


For various characterizations of generalized directed divergence, see Aczél and 
Nath [56], and Kannappan [437, 444, 455]. 


Remark. The measures H,,, [,, Dn, GDn, HE, H,(P; W) possess in particular the 
properties of symmetry, being expansible, and branching. It is shown in Ng [650] 
that these three properties are equivalent to the representability of H,, I,, Dn, GDn, 


etc. All these measures H,, I,, Dn, HE, Dy(P\|Q) + Dn(Q||P) (called symmetric 
difference) possess the sum property with regular generating functions. With the 
exception of entropy of degree /, all are (m, n)-additive for all m,n > 2. 


We now intend to find all measures uw, generated by Lebesgue measurable func- 
tions that are (2, 2)-additive. This amounts to solving the functional equation 


f(pr, qs) + f(pd —r),qU —s)) + f(A — p)r, d — @)s) 
+71 =p) —7),.1=—@)l =) 
=p.) fO=—pla=g+ sie slani= 


for f : JO, 1] x JO, 1[ — R, measurable in each variable, which in turn leads to the 
following (representation) result (see [494, 507]). 


Result 10.19. The measures Un having the sum property with a Lebesgue measur- 
ablegenerating function f are (2, 2)-additive if, and only if, they are given by 
Hn(P||Q) = 4aH, (P) + 4a’, (Q) — 9a? (P) — 9a' He (OQ) 
+ bH)(P) + b'H,(Q) +cl,(P||Q) + c'In(Q\| P) +dn, 
where a,a',b,b’,c,c', and d are arbitrary constants. 
Furthermore, it is easy to see from the result above that if the measures “, are 
(m, n)-additive for all m,n > 2, then a = a’ =d = 0, and that wu, has the form 


(10.61) Mn(P||Q) = bHn(P) + b'Hn(Q) + cIn(P||Q) + In(QIIP). 


In what follows, we intend to examine additional distinct properties that distin- 
guish various measures of information that are of the form (10.61) and isolate them 
as special cases. It is possible to obtain characterizations of various known measures 
by giving suitable postulates and using methods adapted in [494]. 
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e Shannon’s entropy. In the event that 4, depends only on P or only on Q, then 
Ln is of the form bH,(P) or b'H,(Q) accordingly. 

e Inaccuracy. We utilize two of the properties of inaccuracy /, cited in [532] to 
isolate inaccuracy out of the measures (10.61). 
“If two sets of alternatives are asserted to have probabilities which are indepen- 
dent, the inaccuracy of the joint assertion is the sum of the separate inaccuracies.” 
This motivates the following stronger version of the additivity. 

e Strong additivity. A measure /,, is said to be strongly additive provided 


Hmn((P1, P2)||Q1 * Q2) = un(Pi|]Q1) + un(PallQ2), 


where the joint distribution (not necessarily independent) 


(Pi, Px) = (Pity -++s Pins P21s +++ P2ns ++ +5 Dmls+++s Pmn) € To, 
m 
P= (Du) eT}, r= (Du) 
J i 


It is observed that (P;, P2) = P, * P2 when P; and P2 are independent, and 
hence the strong additivity implies the (mm, n)-additivity for all m,n > 2. The 
measures (10.61) are strongly additive in Q if, and only if, 


Hn(P\|Q) = b'An(Q) + cln(P||Q). 


Indeed, by considering (P;, P2) = (Z. 7 7 3), Pi= G, 3), Po = (5. 5); QO; 
Qo = (q,1-—4q), (¢ € J0,1[), it is easy to see that the measures (10.61) are 
strongly additive only when b = 0= cc’. 

“The value of J, is a minimum, for fixed P, when Q = P so that the error term 
(>| pi log pi/qi) is zero. It then reduces to Shannon entropy.” This motivates the 
minimum property (in the variable Q) 


eT°, forall m,n > 2. 


I n 
i j= 


Hn(P\|Q) > un(PI|P) forall P,Q € T°. 


Among the measures (10.61), those that possess the minimum property (in Q) 
are given by 


Mn(P||Q) = bAn(P) — b'Dn(Q\|P) +cln(PllQ), with b' <0, c = 0. 


Indeed, by considering the measures (10.61) with n = 2, we see that as a con- 
sequence of the minimum property, the map g > w2((p,1 — p)\l(g,1—4)) 
has derivative equal to zero at p, from which we conclude that b’ + c’ = 0 and 
80 fln(P||Q) = bAy(P) — b'Dn(Q||P) + cln(P||Q). Then the minimum prop- 
erty Mn(P\|Q) > “n(P||P) is equivalent to b'D,(Q||P) + cD;(P||Q) = 0. 
Since D, > 0, sup Dy (P||Q) = +00, and sup D,(P||Q) < +00, the inequality 


Pp 
—b'D,(Q||P) + cD,(P||Q) = 0 holds if, and only if, —b’,c > 0. Thus the 
measures (10.61) possess the strong additivity and the minimum property if, and 
only if, 

Hn(P||Q) =cIn(P\|Q), (2 9). 
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e Symmetric divergence or /-divergence. [498, 411, 468, 492]. The measures 
(10.61) are symmetric in P and Q (i.e., “n(P||Q) = “n(Q||P)) if, and only if, 
b=b', c=c'; and up(P||P) = Oif, and only if, b +c +b’ +c’ = 0. Thus the 
measures (10.61) are symmetric and have the property “the divergence is zero 
when P = Q” if, and only if, it is given by 


Mn(P||Q) = cLDn(P||Q) + Dn(QI|P)I. 


e Directed divergence. A stronger version of the sum property is the f-divergence 
introduced in Csiszar [185]. The measure ww, is an f-divergence if, and only if, 
it has the representation 


n(PI|Q) = >- aif (pi/4i) 


for some f : JO, co[ > R (f need not be convex as is supposed in [161]). The 
measures (10.61) are f-divergences if, and only if, 


Mn(P||Q) = cDn(P||Q) + ¢’Dn(QIIP). 
In fact, since 
bHy(P) + b'Hn(Q) + cln(P\|Q) + ¢'In(Q\|P) 
= —(b +c) | pilog pi — &' +c) Dai log pi 
+ cDn(P||Q) — b'Dn(QIIP), 


Ln is an f-divergence if, and only if, 
—(b +0) >) pilog pi — 6 +c) >) ai log pi = > aif (vi/ai) 


for some f. This is possible if, and only if, b+c = 0 = b’'+c’. Now the measures 
Hn(P||Q) = cDn(P||Q) + c’Dn(Q||P) have the property 


sup {n(P||Q) < co 
P 


if, and only if, c’ = 0 and consequently 


Ln(P||Q) = cDn (PQ). 


10.2g Sum Form Distance Measures 


Over the years, many distance measures between discrete probability distributions 
have been proposed. The goal now is to give a survey of all important characteriza- 
tions of sum form distance measures between discrete complete probability distrib- 
utions and to point out some related functional equations [478, 151]. 

Almost all similarity, affinity, or distance measures {Uy : re x rg — R, that have 
been proposed between two discrete probability distributions can be represented in 
the sum form 
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(10.54) n(P, QO) = >) b (PK. 4k) 


k=1 


where ¢ : J x I > Risa real-valued function on a unit square or a monotonic 
transformation of the right side of (10.54), that is, 


(10.54b) jul(P, 0) = 703 bina), 


k=1 


where wy : R — Ry is an increasing function on R, P,Q € T Ml The function 
¢ is called a generating function. It is also referred to as the kernel of n(P, Q) 
(Jones [397]). Some important examples of sum form distance measures between 
two discrete probability distributions P and Q in are: 


(a) Directed divergence (Kannappan [431], Kannappan and Sahoo [510], Kullback 
and Leibler [571]) 


(x, y) = x(logx — logy), 
Dn(P, Q) = >> pe log (7) 
k=1 dk 


(b) Symmetric /-divergence (Jeffreys [393], Kannappan and Sahoo [510]) 


P(x, y) = & — y)(ogx — log y), 
In(P, Q)= Vor — qk) log (“) 3 
kel qk 


(c) Hellinger coefficient (Bhattacharyya [124], Hellinger [364]) 


P(x, y) = JSxy, 
Hn(P, Q) = >) / Pea: 
k=1 


(d) Jeffreys distance (Jeffreys [393]) 
o(x,y) = (Wx -— VY’, 
K,(P,Q) = > (/ Pe - Va) 


k=] 


(e) Chernoff coefficient (Chernoff [151], Kannappan and Sahoo [512]) 


d(x. 9) = x%y", 


nN 
Cn,a(P,Q)= >) pig, aed; 
k=1 
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(f) Matusita distance (Matusita [626, 625]) 
il 
p(x, y) = [x* ~ y*|@, 


n 
1 
Mn,a(P, Q) = , IPe — ne 1%; O0<a<l; 
k=1 


(g) Divergence measure of degree a (Aczél and Daréczy [43], Chung, Kannappan, 
Ng, and Sahoo [164]) 


1 = 
P(x, y) = ree Gael er dF 
1 n 
Bn,a(P, Q) = | Sota = | , aFfl; 
k=1 


(h) Cosine a-divergence measure (Chung, Kannappan, Ng, and Sahoo [164]) 


d(x, y)= ; : — ./xy cos (« log (=) ; 
y 


n 
k 
: - >) VRE cos (« log )] ; 


k=1 


NIlR 


Nn,a(P, Q) = 


(i) Kagan affinity measure 


_ yy2 
d(x, y) = Gas) 
y 


n 2 
An(P,Q) = >-4k 1 - P| 
k=l dk 


(j) Csiszar f-divergence measure (Csiszar [184], Kailath [411]) 


ben year(*), 
y 
Zug? O= pif (“); 


= dk 
(k) Kullback-Leibler type f-distance measure (Kannappan [468]) 
P(x, y) =xIf@) -— FO), 
Ln,a(P, Q) = > new — f(gK)}: 


k=1 
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(1) Proportional distance (Van der Py] [812]) 
p(x, y) = min{x, y}, 


Xn(P, Q) = >) min{ px, ge}. 


k=1 


(m) Rényi’s divergence measure (Aczél and Daréczy [43]) 


d(x,y)=x%y'%, w(t) = —— logs, 


1 n 7 
SI oe (Dota ) a#l, 
k=1 


is a monotonic transformation of sum form distance measures. Rényi’s divergence 
measure is the logarithm of the so-called exponential entropy 


1 
n a-1 
Ea a= (= reat) 
k=1 


In order to derive the principle of minimum divergence axiomatically, Shore 
and Johnson [736] formulated a set of four axioms, namely uniqueness, invariance, 
system independence, and subset independence. They proved that if a functional 
Mn i T , x 2 — Ry, satisfies the axioms of uniqueness, invariance, and subset 
independence, then there exists a generating function (or kernel) 6: J x 1 > R 
such that 


Ria (P I Q) = 


n 
Un(P, QO) = >) b (PK. 48) 
k=1 
for all P, @ € T°. In view of this result, one can conclude that the sum form rep- 
resentation above is not artificial. In most applications involving distance measures 
between probability distributions, one encounters the minimization of “,(P, Q) and 
the sum form representation makes problems tractable. 


10.2h Kullback-Leibler Type Distance Measures 


A function Ly : 1 x re — Ry 1s said to be a distance measure of Kullback-Leibler 
type [507, 468] if there exists a generating function f : J > R such that 


(10.54c) un(P, O) = >» pal f (Pe) — £96) 


k=] 


for all P,Q € re. It is easy to see that the directed divergence D,(P, Q) is a 
Kullback-Leibler type distance measure with the generating function f(p) = log p. 
However, it is not obvious whether all Kullback-Leibler type distance measures are 
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essentially constant multiples of directed divergence. The following characterization 
result due to Kannappan and Sahoo [507] regarding Kullback-Leibler type distance 
measures says that if n > 3, then they are essentially constant multiples of directed 
divergence. 


Result (i). The distance measures of Kullback-Leibler type on T°? for fixed n > 3 are 
of the form 

Un(P, Q) = bD,(P, Q), 
where b is a nonnegative arbitrary constant and Dy; (P, Q) is the directed divergence 
between P and Q. 


In view of the definition of a Kullback-Leibler type distance measure, one obtains 
a functional inequality 


n n 
> pf Qe) < >. pet (Pe) 
k=1 k=1 
forall P,Q € re (with n > 3), which is considered in Chapter 16, on general 
inequalities. 
Result (ii). The distance measures of Kullback-Leibler type on r with a continuous 


generating function are of the form 


2 


1 1 
H2(P, Q) = > Pk t — pr)H (» _ ;) —(-—qx)H (« — ;)| 


k=1 


where H : - 5 | — R is an arbitrary continuous, increasing odd function. 


Note that, without the continuity assumption on the generating function, the gen- 
eral form of the distance measures of Kullback-Leibler type on i (that is, on the 
set of Bernoulli distributions) is not known. This is due to the fact that the general 
solution of the functional inequality 


xf(y)+U—x)f(—y) <xf@)+0—-—x)f0 —x) 
forall x,y € I is unknown. 


However, in general, 2(P, Q) includes measures other than D2(P, Q), where 
D2(P, Q) is the directed divergence between P and Q. For instance, H (t) = t yields 


2 
42(P, Q) = 22 Px(Pk — 94)(2 — Pe — 9k). 


10.2i Symmetric Distance Measures 


Many distance measures we listed above are not symmetric. If a measure is not 
symmetric, it can be easily symmetrized. For example, the divergence measure of 
degree a can be symmetrized by taking 
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Ina (P, Q) = Bna(P, Q) + Bna(Q, P). 


In explicit form, Jy,¢ is 


tal Pegn + 4%, “1-2 


Ina (P, Q) = qa-Tl _ |] 


Note that if @ — 1, then lim, Jna(P, Q) = Jn(P, Q). Characterizations of sym- 
a> 


metric divergence measures of degree a on closed domains can be found in [689]. 

In this section, we present some characterization results from Chung, Kannap- 
pan, Ng, and Sahoo [164]. We need the following definition to state the result pre- 
cisely. A sequence of measures {j,} is said to be symmetrically compositive if, for 
some 1 € R, 


Lnm(P *R,Q*S)+ Lnm(P *S,Q* R) 
= 2un(P, Q) + 2um(R, S) + Aun(P, O)um(R, S) 


for all P,Q ¢ T°, S,R ¢ T°. If A = 0, then {un} is said to be symmetrically 
additive. 

For applications of symmetric divergence to “pattern recognition” and “ques- 
tionnaire analysis,” the reader is referred to Ton and Gonzales [805] and Haaland, 
Brockett, and Levine [332], respectively. 


Result (iii). Suppose that a sequence of measures { {tn} has the sum form (10.5) with 
a measurable generating function @ : 17 —> R and is symmetrically compositive for 
all pairs of positive integers m,n (= 2). Then 


n 


un(P, 0) = > [pela log px + blog qx) + pe(a log qe +b log px) 
k=1 


if A = 0, or else 


1 n 
Un(P, Q) = > [Stet + phat) - | ‘ 


k=1 
or 
2 n , Pk 
Mn(P, Q) =—-—]1- Dprq) cos | a log —} |], 
A k=1 qk 
or 
2 
Ln(P, Q) = ~7 


where a, b, a, and f are real arbitrary constants. 
If the sequence { {1} is the measuring distance between P and Q, it is natural to 
assume that 
Un(P, P) =0. 
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Result (iv). A measure of distance {{1n} satisfies the hypothesis in Result (iti) and 
Ln(P, P) = 0 if and only if, 


Un(P, Q) =aJn(P, Q), 


or 
a-1_ 1 


2 
Ln(P, Q)= na (P Q), 


or 


4 
Hn(P, Q) = ~ZNn.a(P, Q). 


The proof of the previous two results relies on the solutions of the functional 
equations 


(10.62) f(pr.gs)+ f(ps,qry=r+s)\fPat Pt+Qgfes) 
and 
(10.62a) f(pr.gs) + f(ps,qr) = fDeOSF,s) 


for all p,g,7r,s € I (solution below). 


10.2j Some Functional Equations 


The solutions of (10.62) and (10.62a) were found in [164, 512] without any regularity 
conditions on f. A multiplicative map M : Ip — R has a unique extension to the 
multiplicative map M : ]0, co[ > R. A logarithmic map L : ]0, 1[ > R has a unique 
extension L : ]0, cof > R. 


Result (v). A function f : i — R satisfies the functional equation (10.62) for all 
D,q,1,8 € Io if, and only if, 
f (p,q) = PILi@) — Lo(p)) + g{Li(p) — L2q)I, 


where L,, L2 are logarithmic on ]0, o[. 


The following result yields the solution of the functional equation (10.62a), 
whose proof can be found in [164]. 


Result (vi). Suppose f : ie — R satisfies the functional equation (10.62a) for all 
P39,1,8 € Io. Then 


f (p,q) = Mi (p)M2(qg) + M1 (q)M2(p), 


where M,, Mz : Ip — Care multiplicative. Further, either M, and M2 are both real 
or Mz is the complex conjugate of M,. The converse is also true. 
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The functional equations (10.62) and (10.62a) can be generalized to 


(10.62b) filpr, qs) + fo(ps, qr) = gi, s)hi(p, g) + g2(p, ghar, 5) 


forall p,g,r,s € Io. Our problem is the following: Find all functions f\, f2, 91, g2, 
h,,h2 : Io > C satisfying the functional equation (10.62b) for all p,g,7r,s € Ip. To 
solve this equation, one studies a series of functional equations that are special cases 
of (10.62b). These equations are the following: 


(10.62c) f(pr.gs)+ fips, y= r+s)fRg+o+Ofes), [164]; 
(10.62d) f(pr,qs)+ f(ps,ar) = fp. dF. 5), [164]; 
f (pr, qs) + f (ps, ar) = 2f(p,9) +2f(r, 5); 
fi(pr,9s) + folps,qry= (+5) fa(p.g)t+ (P+a) fas), [503]; 
fi(pr,9gs) + fo(ps,ar) = BP. Df, 5); 
fi(pr, 9s) + fo(ps,ar) = fap. 9) + f4a@, 9); 
f (pr, qs) + f(ps,ar) = 8 )fp.g+ 8D. ah, s): 
f(pr,qs) + f (ps, ar) = g(r, s)h(p, 4g) + 8(p, Mar, 5); 
f (pr, qs) + f (ps, qr) = gir, s)hi(p, g) + 82(p, g)ho(r, 5); 
f (pr, qs) — f(ps,4r) = 317, shi (p,q) + 82(P, g)h2(7, 5). 


Solutions of equations (10.62c) and (10.62d) are known if they hold for all p, g, 7,5 € 
Io (see Results (v) and (vi)). The solutions of some of these equations on Jp are not 
known. 

We are not aware of any research work that characterizes all the existing sum 
form information measures in a systematic and unified manner. However, there are 
many research articles that deal with similarity, dissimilarity, affinity, or distance 
measures. The interested reader should refer to Basseville [105], Chung, Kannappan, 
and Ng [162], Van der Pyl [812], and references therein. 


10.2k Weighted Entropies 


The Shannon entropy gives us the measure of information as a function of the prob- 
abilities with which various events occur. But there exist many situations where it 
is necessary to take into account both these probabilities and some qualitative char- 
acteristics of the physical events. With this in mind, first we consider the following 
measures. Measures depending on the events themselves (mixed theory) will be con- 
sidered in the next section. 

Let (Q, A, ) be a probability space, and let us consider an experiment that is a 
finite measurable partition {C;, C2,..., Cn} (n > 1) of Q. The weighted entropy of 
such an experiment is defined by Belis and Guiasu [112] and in [331] as 


n 
(10.5) Hn(P; W) = — > wipilog pi, 
i=) 
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and the weighted entropy of degree f (4 1) introduced in [258] is 


n 
(10.5) HEP; W) = (2'-4 — 1)! > wip? — pi), 
i=1 
where p; = (C;) is the objective probability of the event C;, P € In, and W = 
(w1, W2,..., Wn) With w; € Ry; the nonnegative reals. 
A sequence of measures fy : Tp, Xx Ri — R (reals) is said to have the sum 
property provided there exists a function f : J x R, — R such that 


(10.63) fin(P; W) = >> f (pi, wi) 
i=l 


for all P ¢ T,, W ¢ R, where J = [0, 1] and the function f is called a generat- 
ing function. Evidently, the weighted entropies (10.5) and (10.5a) possess the sum 
property. 

The weighted entropy of degree / satisfies the composition law 


m m 


HEP * Q, U * V) = HAP, U) > 044; + HE(Q, V) So uipi 
(10.63a) r= a 


+ AHE(P,U)- H8(Q,V) 


for all P € Ty, Q € Tm, U € R%, V € R®, where 2 = (2! — 1)! This 
property is known as the weighted A-additivity of He. If 6 — 1, then (10.63a) 
yields 


m n 


(10.63b) — Himn(P * , UU * V) = Hn(P,U) + >) 0jqj; + Hm(Q,V)- >) uipi 
j=l =1 


(which also holds for 2 = 0). The property above is called the weighted additivity of 
Hp. 

If f is Lebesgue measurable in each variable in (10.63), then {,} is said to 
have measurable sum form. A sequence of information measures {{;} is said to be 
weighted (€, m)-additive of type (a, B) if 


m € 
(10.63c) — tem(P * Q,U * V) = > 0jq/ we(P,U) + > wip Hm (Q, V) 
j=l i=l 


forall Pe Tr, QETm, UE Ré, Ve R’, and a, 6 € R-— {0}. Notice that if 
a= 1= f, then (10.63c) reduces to the weighted (¢, m)-additivity. 

In axiomatic characterizations of information measures, one identifies some 
information-theoretic properties (such as sum form, recursivity, additivity, additivity 
of type (a, #), and so forth) and other relevant characteristics (such as monotonic- 
ity, boundedness, continuity, and so forth) that restrict the information measures to 
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some class of admissible functional forms. The objective is to seek the representa- 
tion of all sum form information measures that are weighted entropies. The weighted 
entropies possess the algebraic properties of “sum representation” and some form of 
“additivity”. Characterization of the weighted entropies by these properties led to 
the study of many “sum form” functional equations. 

The measures “, : T, x RY. — R are called weighted f-additive (weighted 
additive when £ = 1) provided 


m 


(10.63d)  fmn(P * O; W*V) = >) qjoj- MnP: w) +> of wi - Um(Q; V) 
j=l i=] 


forPe€ Tn, Q€Im, WER), V ERY. 

Note that the sum property in conjunction with the “additivity” and a regularity 
condition with some initial conditions characterize these measures. The properties 
of additivity and the sum representation led to the study of the “sum form” functional 
equation 
(10.64) 


n m m 


>> faa. wWjivj) = = Sa iL stnm+ Dol erie vj), 


i=1 j=1 


whose measurable (in each variable) solution f : J x Ry — Ris given by [448] 


apulogp+bpulogu, fP=1, 


eal areca BAI, 


where a, b, c are constants. 
A generalization of (10.64), 


n m 


> h(vig;. wivj) = = Die LHe m+ Deri ws 


(10.64a) thier 
>" 8(gj,0;), 
j 


treated in Kannappan [454, 478], has the measurable solutions h, g,k : 1 xR; —> R 
given by 

g(p,u) = apu log p+ bpu logu + ci pu, 

k(p,u) = apu log p + bpu logu + cpu, f=, 

h(p,u) = pu(alogp + blogu+c+c), 

g(p,u) = apu + bpu, 

k(p,u) = —bp!u + bp + cp +d, pe. 

h(p,u) = bput+ apu, 
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Another generalization of (10.64) is the equation 
(10.64b) 


n m 


>> f@ig;, wivj) = = Dae atm w+ Doom Py ati), 
yal 


i=] jy 


where a, f, y, 6 are nonzero reals. 
Result 10.20. (Kannappan [452]). Let f, g,h : I x Ri — R be measurable in each 
variable and satisfy (10.64b) for a fixed pair m,n > 3. Then they are given by 
f(x, u) = x%u8 (dlogx +alogut+tb+c)+aix+bhi, 
g(x,u) = x%u? (dlogx + alogu +b) + aox + bp, 
h(x, u) = xu? (d log x + alogu+c)+a3x +b, 


fora=y, BP =0, and 


f(x,u)= axtu? + bx? uo +ayx +b, 
g(x,u) = ax*u? + cx’ u® + arx +bo, 
h(x, u) = —cx%u® + bx? u® + a3x +b3 
otherwise, with a, + mnb, = 0 = a2 + nb2 = a3 + mb3, where a,b,c, and d are 
arbitrary constants. 
The sum form and weighted A-additivity led to the study of the functional equa- 
tion 


(10.65) 


n m 


YD vig: uivp)f (Vieni) 
i=1 j=l 


m m 


= Low D+" PLN. +AL FO. 1X say. 


i=l i=l 


where P « T? 


n? 


Qe Bea U € R, V € R%, and J areal constant. For 2 4 0, with 


F(p,u) = put+Af(p,u), 


(10.65) reduces to 


n m 


(10.65a) D> Fig; ui =D Fonm): 2 Faye 


jel js! 


for all P ¢ 2, Q € I), U e€ Ri, and V € R%, whose general solution f : 


Io x Ry —> R (holding for a fixed pair m,n (= 3)) is given by [503] 
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F(p,u)=A(p)+c, forp€lo, ue Ry, 


with condition (A(1) + mnc) = (A(1) +nc)(A() + mc), where A is additive and 
c is a constant. 
A sequence of information measures {/,} is said to be weighted (€, m)-additive 


of type (a, B) [506] if 


m 


t 
Hem(P * O,U *V) = > vjqh « ne(P,U) + > wip? um(Q, V) 
Pe! i=] 


forallP €T?, QET®, UER‘, V ER", anda, f e R— {0}. 

From the sum representation and the weighted (¢, m)-additivity of type (a, f) 
there results the functional equation 
(10.64c) 


m 


fi 
> > Fpig;. wiv) = > uip? - >> fF @j.0p) + > aig) >" (pis ui), 
J j i 


i=l j=1 i 


the general solution of which (10.64c), when f = 2, and m = 3, is given by [506] 


A(p) + au(p* — p*) ifa # Bp, 


aa be + L(p)up* + Li(ujup* ifa # B, 


where L is logarithmic on Jo, L1 is logarithmic on R, and A is additive on R. 


Result 10.21. [506]. Let uy : be xR > Ri = 2,3,...) have the sum form and 
satisfy the weighted (2, 3)-additivity of type (a, 6), where a, B € R — {0, 1}. Then 


n 
a> ui(ps — ph) ifa # B, 
Ln(P, U) _ n 
DD 4ipelL(pi)+ Dd ptuilitui) fa=B, 
1 


1= i= 
where L : Ip) > R, L, : Ry — Rare logarithmic functions and a is a constant. 


The measures obtained contain some of the known weighted information mea- 
sures. 


10.3 Mixed Theory of Information—Inset Measures 


In the last section, we considered measures (weighted) that depend not only on the 
probability but also on some qualitative characteristics of the events. Finally, in this 
section, we consider measures that depend not only on the probability of the events 
but also on the events themselves that were initiated and developed by Aczél and 
Daréczy [42, 51]. The resulting functional equations involve set-theoretic parts that 
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are difficult to solve. Now we present a brief survey of this mixed theory with some 
applications (see [42, 57, 51, 446, 22, 19, 23, 520, 246, 725, 633, 50, 453, 521, 239, 
236, 459]). 

Measures of information or uncertainty depend in the classical information the- 
ory solely on the probabilities of the events (an event may be the receipt of a message, 
an answer on a questionnaire, a gamble, an outcome of an experiment, etc.), not 
on the events themselves [43]. In the nonprobabilistic theory of information, it is 
assumed that the measure is determined solely by the events themselves. Now, 
a mixed theory of information is proposed, where measures of information may 
depend both on the events and their probabilities. 

One of the aims of the mixed theory of information is to find and characterize 
inset measures such as inset entropies, inset derivations, and other similar measures 
of information having certain useful properties analogous to those in the probabilis- 
tic theory of information (classical information theory). There is an obvious advan- 
tage in allowing the measures of information to depend also on the events and not 
just their probabilities. To this end, here information measures known as “entropy 
of a randomized system of events” or “inset entropy” are defined and studied as 
below. 

A nontrivial example of a previously known information measure that fits better 
into the new theory than into the classical one is provided later. It is shown how the 
inset entropies arise from entirely different assumptions in the theory of gambles. 
The theory of gambles deals with the problem of characterizing the preferences of 
an individual who selects one gamble out of a set of gambles. A gamble is a lot- 
tery, bet, or risky situation with various (specified) possible mutually exclusive and 
exhaustive outcomes, each outcome having a (specified) surjective probability that it 
will have occurred by some definite future time. The possible outcomes of a gamble 
are called prospects. The traditional solution of predicting the choice is for the indi- 
vidual to assign a utility number to each outcome of the gamble, multiply the utility 
number by the probability of the outcome, add the products to give the expected 
utility, and then choose the gamble with the largest expected utility. This method is 
called the Expected Utility Rule. If the function that computes the expected utility 
of the gamble from the assigned utility numbers is generalized, then these functions 
that give the new rule and still satisfy the basic properties of the gambles are to be 
determined. It turns out that all possible functions for the Generalized Utility Rule 
under the assumptions are given by the inset entropies or by the inset entropies of 


degree £ (6 (> 0) £ 1). 


Notation and Definition 


Let B be a ring of sets (containing with any two sets their union and difference and 
thus also their intersection and the 0-set) with its elements the “events” 


On = {X = (41, %2,...,%n) | xix; = Ofori Aj; x1 € B; i,j =1,2,...,n}. 


Then (X, P) € Q, x Ty, is called a randomized system of events. 
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Inset Entropy. An entropy of a randomized system of events, an inset entropy for 
short, is a sequence of mappings My : Qn x Ty, > R (nm = 2,3,...; R the set of 
real numbers). 
Various systems of postulates were used to characterize the measures: Shannon’s 
entropy, entropy of degree f 4 1, directed and generalized directed divergences, etc. 
Here we intend to consider their generalizations to inset entropy, inset entropy of 
degree /, inset derivation, etc. 


(i) Symmetry. An inset entropy “m is m-symmetric (m > 2) if 


=) _ oo) 

Lm = Hm 

P15 P25+++5Pm Pa(1)> Pa(2)>+++» Pa(m) 

for all (X, P) € Qm x Tm and all permutations a on {1, 2,...,m}. Symme- 
try means that the uncertainty given by the measure does not depend on the 
labelling of events. 


(ii) f-recursivity. The sequence fy is /-recursive if, for all n > 2 and all ran- 
domized systems of events (X, P) € Q, x In, it satisfies 


ll X15X2,-.--5Xn =U 1 KX D9 NG 52505 XH 
NB ae 2a iy "~"\ pi + po, P3,-++» Pn 


xX] x2 
+ (1+ pune) 
P\i+P2 Pirp2 


‘ 2 B . X1 X2 _ 
with the convention 0: « /00 a) 2) 0s 


(ii’) Recursive. The sequence jy is said to be recursive if 8 = 1 in (ii). 
Recursivity shows how splitting the event x; U x2 into the two events x; and x2 
changes the uncertainty. 

(iii) Measurability. An inset entropy is measurable if the function 


x1 x2 
Pe PIN gg 


is measurable on JO, 1[ for all fixed (x1, x2) € Qo. 
(iv) A deviation of a randomized system of events, an “inset deviation” for short, 
is a sequence K, : Q, x Ty, x Ty ~ R(1 =2,3,...). 


10.3.1 Characterizations 


The definitions above lead to the following characterization results [42, 51] regarding 
inset entropy, inset entropy of degree (6 ¥ 1), inset deviation, etc. 


Result 10.22. (Aczél and Daréczy [42]). The sequence ttn : Qn X Ty > R= 
2,3,...) is recursive (ii'), 3-symmetric, and measurable if, and only if, there exists 
a constant c € Rand a function g : B > R such that 
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n n n 
(10.66) Hn(X, P) = (U n)- > pig i) — ¢ >) pilog pi 
i=l i=l 


i=l 
forall (X, P) € Qy x Ty (n = 2) with the convention 0 - log 0 := 0. 
Remark. The last member in (10.66) is, of course, the Shannon entropy Hy. 
n 


If (J x; is the whole space Q (the certain event) (i.e., if the system x1,x2,...,Xn 


i=1 
is complete), then g(Ux;) = b is a constant and letting h(x) = b — g(x) in (10.66) 
gives 


n 
(10.66) Hn(X, P) = >? pih(ai) — ¢ D? pi log pi. 
i=1 i 


Thus, in the case of complete systems of events, the general recursive, 3-symmetric, 
and measurable inset entropies are sums of the expected value of an arbitrary random 
variable and of an arbitrary constant multiple of the Shannon entropy. 


Result 10.23. (Aczél and Kannappan [51]). There exists a function k : B > R such 
that 


(10.67) un(X, P) = >_ kx) pe — (Us) 


i=l i=l 
forall (X, P) € Qy x Ty with B £0, 1 and with the convention OP := 0 if, and only 
if, the sequence Lyn : Qn X Ty, > R (n = 2) is 3-symmetric and B-recursive (ii). 


Remark. If & is constant (a (21-8 — 1)~1, say), then (10.67) is a constant multiple 
of the entropy of degree /, HY (P). If we define g(x) := a(2!~4 — 1) — k(x), then 
(10.67) goes over into 


(10.67a) Hin(X, P) = «(Un)- Safecr+ anger 


i=1 i=1 


As 8 — 1, (10.67a) tends to g(Ux;) — >* pig (xi) + aHn(P), the result obtained in 
Result 10.22 [42], under the measurability condition, in the case 6 = 1, where H,, is 
the Shannon entropy. 


Remark. The main step to obtain (10.66), (10.66a), and (10.67) is to solve the 
functional equation 


fis)+d—s) fr (+) =Ad)t+d-o* fa (+) 


(see [442, 489, 495]), a generalization of the fundamental equation of information 
that is derived from the 3-symmetry and recursivity (f-recursivity) of Un. 
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10.3.1.1 Characterization of Inset Deviation Entropies 


To illustrate the interplay of sets in functional equations, we prove Theorem 10.24, 
which follows. 

An inset deviation is recursive if, for all n > 2 and all randomized systems of 
events 


X1,-+-5Xn 
Ply+++> Pn} € Qn x Tn x Tn, 
G15+++54n 
it satisfies 
X1,%X2,-++5Xn Hi UX N35 os oy Hn 
Kn | Pi, P2>+-+> Pn | = Kn-1 | P1 + P2, P3,-+-> Pn 
G15925-++54n Gi +42, 935+++549n 


xX] x2 
+ (pit p2)K2 | pi/(pi + p2) p2/(pi t+ p2) 
qi/(qi +42) 92/(qi + 42) 


with the convention 
Xi X2 
0- K2 {| 0/00/0}] =0. 
0/0 0/0 


The sequence {K,,} is m-symmetric (m > 2) if 


X1,-++5Xm Xr(1)>+++5Xr(m) 
Kin | Pls+-+> Pm |) = Kin Pa()>+++» Pa(m) 
W15+++54m da(1)> +++» Ya(m) 
for all 
X1,-++5Xm 
Pisy+++5 Pm | € Qn X Tm X Pin 
W15+++5dm 
and all permutations a on {1,2,...,m} (meaning simply that the uncertainty does 


not depend on the labelling of events). Finally, our inset deviation is measurable if 
the function 


Xi x2 
(t,u) > Ko{ 1—-t ¢ 
l-uu 


is measurable in each variable t and u on J = ([0, 1] x]0, 1[) U{(O, 0)} U{(, 1)} for 
each fixed (x1, x2) € Q2. We will prove the following theorem. 


Theorem 10.24. (Kannappan [446]). The sequence Kn : Qn x i —> R(n => 2) 
is recursive, 3-symmetric, and measurable if; and only if, there exists a function 
g:B— Rsuch that 
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Kissy n n 
Kn | Pis---> Pn =«(Ux)- re 
Gisciss Gn i=l i=l 
10. 
ee n n 
—a > pilog pi +c >. piloggi 
i=l i=1 
forall 
Ns rea Xn 
Pis+++>Pn} € Qn x Tn xX Th, 
J15+-++>4n 


where a and c are arbitrary constants. 


Remark. We follow the conventions 0-log 0 = 0; whenever g; = 0, the correspond- 
ing pj = 0. 


Proof. It is evident that any inset deviation given by (10.68) with arbitrary a,c € 
R and g : B — R is recursive, symmetric, and measurable. Now we prove the 
converse. 

We introduce a function F : Q2 x J > R by 


XxX, x2 
(10.69) F(x1,x23t,u) = Ko{1-—t ¢ 
l-uu 


Then, for each fixed (x1, x2) € Q2, F is measurable in each variable t and u. 
Recursivity for n = 3 and 3-symmetry has two important consequences. On the 
one hand, it shows that K2 is symmetric, too, which by (10.69) yields 


F(x1,x2;t,u) = F(x2,x1; 1—-1t,1—u) 


for all (x1, x2) € Qa, (t, uv) € J. On the other hand, they give 


x1 UX2 x3 x] x2 
Ko{ 1l-t ¢ }+Q—1t)K2{ 1-—s/U—t) s/A—-t) 
l-u u 1—v/d —u) v/(0 —u) 
XxX] X2 X3 


=kK3|{1-t—s s t 
l-u-vou 
XxX] X3 X2 
=kK3{1—-t-s ts 
l—-u-vu vo 
x,U x3 x2 XxX] X3 
= Kz} l-s s }+(U-—s)K2]{1-t/d-—s) t/Ud—-s) 
l-—v ov 1—u/(—v)u/d —v) 
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In other words, F given by (10.69) satisfies the functional equation 


F(x, Ux2,x3;f,u) + (1 —1t)F (1.20 — -.) 
(10.70) ; ‘ 
= F(x, Ux3, x0; 5,0) + (1 —s)F (sas —, =) ; 
l-—s l-ov 
for all (x1, x2, x3) € Qs, (¢, u), (s, 0) € ([0, 1] x JO, IP U {(0, 0)} withO <r+s < 
1,0<u+ov <1. 
For fixed (x1, x2, x3) € Q3 and uw, v in JO, 1[, (10.70) reduces to the equation 


(10.70a) Fit) +d —-)F (+) =Rigt+0—)h (+) 


fort, s € JO, 1[ withO < t+ < 1, with the notation 


F(t) = F(x1 Ux2, x3; t,u), F3(s) = F(x U x3, x2; 8, v) 


D u 
) , Fa(s) = F (sas Ss, ) 
—u l—b 


The general solutions, measurable on ]0, I[, have been determined for this equa- 
tion (10.70a) in [489], from which and from (10.70b) we obtain F>, F3, and F4, 
which we do not need here, and 


(10.70b) 


Fy(t) = F (1.20 tf, i 


F\(t) = F(x U x2, x3; t, u) 
= —a(x; Ux2, x3; u)(tlogt + (1 — ft) log — t)) 
+ D(x; U x2, x3; u)t + c(x, Ux] Ux, x3; u), 
so that 
F (x1, x3; t,u) = —a(x1, x3; u)(tlogt + (1 — t)log( — r)) 
(10.71) + b(x1, x3; u)t + c(%1, x35 U), 
(¢ € JO, 1[, uw € JO, 1[, (41, x2) € Qz). 


If we substitute (10.71) and (10.70) and compare the terms containing 
(1 —t)log( — ft) and s logs on the left- and right-hand sides, we obtain 


(10.72) a(x1 U x2, x3; u) = a(x, x2; 0)/(1 | 


a(x, U x3, x2; v) = a(x}, x2; v)/U — uv) 


foru,v,u+o € ]0, 1[. 
From (10.72), we have a(x1Ux2, x3; vu) = a(x1Ux3, x2; v) foru,v,ut+o € JO, 1[; 
that is, a(x, U x2, x3; u) is independent of uv. Thus, 


a(x1, x3; u) = a(x1, x3). 
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Now use (10.72) again to get 
a(x, Ux3, x2) = a(x], x2) = a(x, U x2, x3), 

which, first setting x; = 0, gives 

a(x3, x2) = a(0, x2) = a(x2), _ say, 
and then yields a(x2) = a(x3), where a is a constant, so that 

a(x, y;u) =a 

is aconstant. Thus (10.71) becomes 


F (x1, x3; t,u) = —a(tlogt + (1 —1f) log( —1)) 


(10.71a) 
+ b(x1, x3; u)t + c(x1, x33 U). 
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We substitute (10.71a) into (10.70), and the comparison of the coefficients of ¢ 


and the constant terms (terms free from ¢ and s) yields 


(10.72a) b(x, Ux2,.x33 u) — C | x1, x23 ae = b{x1,%x3; : 
l-u l—-ov 
and 


c(x; U x2, x3; u) +e (x1. — 
—Uu 
(10.72b) 


u 
=c (sas = -) + c(x1 U x3, x2; v), 


respectively. 
For fixed (x1, x2, x3) € Q3, by introducing the functions 


fl) = 6 + c)(1 U x2, x3; u), hu) = (b + c)(X1, x35 u) 


and 
k(u) = c(x®, Ux3, x2; u), (u € JO, If, 


from (10.72a) and (10.72b) there results a Pexider equation, 


f@=h (+) +k(v), foru,v €]0,1[ 


(that is, f(wz) = h(w) +k(1 — 2), withw = Fo ,z=1-0). 
The measurability of the functions implies 


k(v) = dj log — v0) + ad, 
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where d and d are functions of x; U x3, and x; that is, 
C(x, U x3, x2; u) = di (x1 U x3, x2) log. — uw) + d2 (x1 U x3, x2), 
or 
(10.72c) c(x1, x2; u) = di (x1, x2) log(1 — uw) + do(x1, x2). 
The substitution of (10.72c) into (10.72b) leads to 
dy (x1, x2) = di (x1, x3), di(xy U x2, x3) = dy (x1, x2) 


and 
dy(x1 U x2, x3) + d2(x1, x2) = do(x1 U x3, x2) + d2(x1, x3), 


the former yielding d; (x1, x2) a constant, c (say), and the latter for x; = 0 yielding, 
with the notation g(x) = d2(0, x), 


(10.72d) da (x2, x3) — d2(x3, x2) = d2(0, x3) — d2(O, x2) = (x3) — g (x2). 
Now, the symmetry of F and (10.7 1a) give 
C1, x2; u) = b(x2, x1; 1 -—u) +c(x2, x1; 1-4), 
which, together with (10.72c) and (10.72d), yields 


l-u 


b(x2, x1; 1—u) = clog + g(x2) — g(x1) 


or 

(10.73) b(x1, x2; w) = clog —— + g(x1) — g(x). 
Finally, use (10.72a) and (10.72b) to get 

(10.73a) c(x1, x2; u) = clog(| — uv) + g(x U x2) — g(%}). 


Thus, from (10.69), (10.71a), (10.73), and (10.73a), we obtain 


Xi X2 
Ko|{1l—u t ] = FQ, x23 t,u) 
l-uu 


(10.74) = —a(tlogt + (1 — 1) log(1 — 1)) 


+ c(tlogu + (1 —f)logd — u)) 
+ (g(x1) — 8 (x2))t + 81 U x2) — 81) 
for (x1, X2) € Qo, t € [0,1], uw € JO, LE. 
An examination at the boundary points (0, 0), (1, 1), (1, u), (@ € JO, 1[) with the 


help of (10.70) reveals that F satisfying (10.70) indeed has the form (10.74) for all 
(x1,.%2) € Qo, (t,u) € J. 
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This shows that (10.68) holds for n = 2. Suppose it is true for n — 1. Then, by 
the recursivity and (10.73), 


X1,%X2,--+5Xn x1 U%X2,%3,...,Xn 
Kn | Pi, P2>---» Pn | = Kn-1 | Pi + P2, P3,-+-> Pn 
G15 925+++54n Gi + 42, 935+++54n 

X1 x2 


+ (pi + p2)K2 | pi/(pi + p2) p2/(pi + p2) 
aqi/(qi +42) q2/(q1 + 92) 


= (xj Ux2---U Xn) — (pi + p2)g (x1 U x2) 


n n 
— >) pig) — a>) pilog pi — a(pi + p2) log(p1 + p2) 
i=3 i=3 


n 
+c >) piloggi + c(pi + p2) log(qi + 42) 


i=3 
1 D2 
+ (pi + p2) Ea U x2) — f g(x1) — i g(x2) 
Pit p2 Pit p2 
Pi Pl P2 P2 


log —a log ———— 
Pit p2 Pit p2 Pit p2 Pit p2 


Pi q1 P2 q2 
log ——— +c log 
Pit p2 qat+4q Pit p2 qi + q2 


nN n n n 
= (U x) — > pig(xi) — a >) pilog pi +c > piloggi 
i=1 i=l i=l i=l 


—-@a 


+c 


for all (41,...,%n) € Qn, (P1,---5 Pn)» (Gis--+59n) € Qn; that is, (10.70) holds 
also for n. This completes the proof. 


Remarks. The third member, >° p; log p; in (10.70), is, of course, the Shannon 
entropy. The fourth term, >> p; log gq; in (10.70), is the inaccuracy. For c = a, the 
third and the fourth terms, >” p; log oi in (10.70), are the directed divergence. When 
g is constant, the right side of (10.705 becomes 


a> pilog pi +b > piloggi +c, 


where a,b,c are constants, which include the above-mentioned measures (see 
[489]). For c = 0, the right side of (10.70) becomes (the inset entropy) 


n n n 
(U «) — >) pig(xi) — A>) pilog pi, 
i=1 i=1 i=1 


the result found in [42]. 
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10.4 Applications 


10.4.1 Continuous Shannon Measure and Shannon- Wiener Inset Information 


The usual measure 
DvD 

(10.75) -{ P(t) log P(t)dt 
u 


for uncertainty for continuous probability distributions (P is the probability fre- 
quency function) is, contrary to one’s first impression, not the limit of the Shannon 
entropy for discrete distributions 


n 
(10.75a) = >» p(t;) log p(ti). 
i=1 
It is the limit of — >> P(q;) log P(qi)(ti — ti-1), (gi € |ti-1, ti[); that is, with the 
appropriate choice of g; (i = 1,2,..., 7) and with the distribution function F(F’ = 
P, F(u) =0, F(v) = 1), the limit of 


PGy=f Ga) 
ti — ti-1 


n 
(10.75b) — IF Gi) — FG-1)) log 
i=1 
If, as usual, F(t;) — F (tj-1) = pj; 1s interpreted as the probability belonging to the 
n 
interval X; = |tj-1, til, i =1,...,2, U Xi = lu, v[= U, then (10.75b) goes over 


i=1 
into an inset entropy 


n 
(10.76) — >) pilog pi + >. pilog €(X;) 
i=l 
(where £(X;) is the length of X;). 
Contrary to (10.75a), the amount (10.76) is not necessarily nonnegative since 


(10.75a) may be positive for some probability distributions and negative for others. 
Forn = 1, X; = U = 0, 1[, (10.76) reduces to 


(10.76a) log €(U) 


as a measure of uncertainty if we know only that the value of the random variable 
falls into U but don’t know its probability distribution. If we know the distribution, 
the uncertainty reduces to (10.76). The difference between these two uncertainties is 
the amount, 


(10.77) 


n 
& (set) <wee(Ux _ T pilogecx 


=> pil oe 
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of information gained from the probability distribution. By Shannon’s inequality, 
(10.77) is nonnegative. Even more importantly, putting pj = 1, pj =O A j) in 
(10.77), we get 


X15 0-25 Xin- +5 Xn = €(X;) 
ee) (“Be gt) = — we ee 


as the measure of information gained from the knowledge that the value of the ran- 
dom variable lies in the subinterval X; of U. But (10.77a) is precisely the measure of 
information introduced by Wiener, which together with Shannon’s measures (10.75) 
and (10.75a) was at the source of information theory in 1948. For a characterization 
of “Shannon-Wiener Inset Information” (10.77), refer to [22]. 


10.4.2 Theory of Gambles 


Consider a gamble G with independent outcomes x1, x2,...,X, with respective 
probabilities pj, p2,..., Pn. Then X = (x%1,...,Xn) € Qn, Pp = (P1,---5 Pn) € 


T,, and denote 
G _ (X, P) — a) ; 
Pis,+++>5Pn 


Let U be the function that assigns utility numbers to the x;; that is, U : B > R. 
Then the Expected Utility Rule can be written as 


(10.78) U(G) = > piV (xi). 
i=l 


The Expected Utility Rule (EUR) for gamble is isomorphic to the ordinary rule for 
computing a marginal probability. If we were able to find a new utility rule for 
gambles, then this would presumably be isomorphic to a new rule for computing 
marginal probabilities. 

What utility of gamble rules are of interest? It is proposed that a utility rule for 
gambles should have a simple additive symmetry in order to be “interesting”. This 
rather weak condition, with boundary conditions for degenerate gambles, is shown 
to imply that the utility rule for a gamble should be a member of a specific class 
of functions. One member of this class is the EUR, and another is the “EUR plus 
Entropy”. 


(10.79) U(X, P) = D°(piV i) — bpi log pi), 


i=1 


where b is a positive constant. These utility rules were previously known. The 
remaining, previously unknown, members of this class have the form 


(10.80) U(X, P) = ¥ riven a(S of- ) (#1). 
i=1 i=1 
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Inspection of (10.78) and (10.80) suggests a generalization of the utility rule to the 
form 


(10.81) U(X, P) =>) f UG). Pi: 


i=1 


where f (U, p) is a continuous and “differentiable” function of U and p. Equation 
(10.81) has the desirable attribute of symmetry, so that the right-hand side of (10.81) 
is unaffected by any interchange of subscripts among the n pairs (U (x;), pi). There 
are other functions, not of the form (10.81), that also have this type of symmetry. 
One, quite obviously, is the rule 


(10.82) U(X, P) = [| fUGi), pi). 


i=1 


Note, however, that (10.82) can be converted into the form (10.81) by a suitable 
monotonically increasing transformation. This permits us to concentrate our atten- 
tion on rules of the form (10.81). There are two elementary properties that we would 
like a utility rule to have. First, if a prospect is impossible (i.e., has probability equal 
to zero), then we should be able to ignore it, so that 


U(x1, = ee »Xn—-15,Xn3 Pi; sa 9 Pn—1,9) = U(x, 25 »Xn-1> Pi; - 7 5 puoi). 
This, in (10.81), implies that 
(10.83) f(U (Gn), 0) = 0. 


Second, if a prospect is sure (i.e., has probability equal to one), then there is really 
no gamble at all, so we should have 


U(x1; 1) = U(x). 
This, in (10.81), implies 
(10.83a) f(U (x1), 1) = U1). 
We also make use of the compound gamble rule, 


nal = (V1, --+5 Ym G15 -++5 4m) 
U[(V1, ¥2s5 +++ Yims 1s ++ +s Gm), X25 +++5%n3 Pl, P25+++s Pal 
= U(y1, «. +5 Vy X25 66+5Xn3 PIs «++ 5 D1Gms P25 +++ Pn)s 


forallm >1,n>2. 

There should be a nontrivial dependence of the rule (10.81) on U(x). So, f 
should not assign the same expected utility to prospects that have been assigned 
different utility numbers. Thus f must be nonconstant in its first variable in its 
domain when its second variable is fixed. A certain amount of regularity is required 
for f. From these considerations is obtained the following result. 
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Result 10.25. (Meginnis [633]). Let U be a utility function defined on B, U : B > 
R and let f be a function defined by f : R x [0,1] — R that is nonconstant in 
its first variable when its second variable is fixed; i.e., f (-, p) is nonconstant and 
the second partial derivative in its first variable exists and is continuous. If U is 
recursively defined by (10.81), satisfying (10.83) and (10.83a), then the function f 
satisfies 


f(U (x1), pi) + f (U (x2), p2) 


10.84 
= #[ [7 (ven. Ee ) +4 (veo. = )} i+ po| 
Pit p2 Pit p2 


and U is given by 


> piU (i) + api log pil, c=1, 
(10.85) U(X, P) = 4 '=! 
d [pre -Awi- ep]. cA 1. 


L 


10.4.3 Recursive Inset Entropies of Multiplicative Type 


At the Twenty-first International Symposium on Functional Equations, in 1983, 
J. Aczél presented a list of unsolved problems in the theory of functional equations. 
Now we present the solution of Aczél’s Problem 10, described as follows. 

Let B be a ring of subsets of a given set Q = UB. Let B* = B\(G}, and for 
n=2,3,..., let 


Dy = {15 235s ss98n) | pe BR, ai Nae 0 foralle sy). 
Let k be any fixed positive integer, and 
Ij =10,1', D={(p,4)|p.4.p +4 € 1} 


A map g : D2 x 7 — R (the reals) is called a (k-dimensional) inset information 
function of multiplicative type on the open domain if, for some multiplicative map 
M: i, — R(e., M(pq) = M(p)M(q)), ¢ satisfies the equation 


gp (x1 U x2, x3, p)+M( a P)p (nF: 1) 
(10.86) s 


= (x1 Ux3, x2; q) + M1 — gy (x. G; -.) 

for all (x1, x2, x3) € D3 and all (p,q) € D. To avoid triviality, we assume D3 4 VY. 

The problem posed by Aczél is to find the general solution of (10.86), the multidi- 

mensional fundamental equation of inset information of multiplicative type. 
Equation (10.86) and various special cases of it have a long history, to which 

many authors have contributed. The solution to this problem leads to an axiomatic 
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characterization of measures of inset information J) (x1,...,%n,P1,---5 Py) that 
have the representations [239] 


f+ >i ai) - (Us) +L(P\)+ > 40i,B;) (for M = 1), 
2 1 1 
pean Mepn+SecsoMe)—F(Us LM) (for M additive), 
2 1 1 


f)M@) + >> g@i)M@) - AU x) (for M # 1 and not additive). 
2 1 


Here L is logarithmic, / is additive in the events and logarithmic in the probabilities, 
and f and g are arbitrary functions. 
A key step in the process is to solve the equation 


J (x1,.X2,.%3) + J (x1, x2) = J (x1 U x3, x2) + J (x1, x3) 


for disjoint triples x1, .x2,x3 of nonempty sets in B (a cocycle equation). A new 
construction was developed to handle the case where B happens to be an algebra. 


Branching Inset Information Measures (IIM) 


Definition 10.26. An inset information measure (J,), I, : Dn x Ip — R, has the 
branching property if there is a function A : D2 x i — R such that 


(10.87) £ oo) = oer ones ) okt @ 8) 
Pis+++5Dn Pit P2, P3,--+> Pn Pl p2 
for all (X, P) € Dy X Ty (rn = 3,4,...), 


e is 3-semisymmetric if 


X1 X20 X3 X1 X3 X2 
I = , (CP\SDxTy 
16 p2 5( P3 - re ac 


e is 4-semisymmetric if 


X1 X2 X3 X4 X1 X3 x2 X4 
a) 
P1 P2 P3 P4 P1 P3 P2 P4 


for all (X, P) € D4 x 14, and 
e is €-symmetric for some f € Z*, € > 3, if 


hy ee? _ 1, (). (X,P)eD, xT), 
Pis-++s Pe Pr(l)>++ +> Px(l) 


for all permutations z on {1,2,..., €}. 
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The branching property is one of the most fundamental axioms for any informa- 
tion measure having the following interpretation. In an experiment X with possible 
outcomes x1, ...,X, and corresponding probabilities p1,..., Pn, the left-hand side 
of (10.87) is the uncertainty of X. The right-hand side contains as first summand the 
uncertainty of the experiment Y with outcomes x; U x2, .x3,...,X», and correspond- 
ing probabilities p; + p2, p3,..., Pn. SO We are uncertain whether x; or x2 in the 
outcome will occur. To overcome the difference of the uncertainties of X and Y, a 
third experiment C will be performed to decide whether x; or x2 will occur. This is 
taken into account by the second summand A on the right-hand side of (10.87): A 
depends only on x1, x2 and pj, p2, which is a weak assumption. 

It is interesting that it is sufficient to use only the branching property and the 
3-semisymmetry and 4-semisymmetry to get an important characterization of an ITM 
Un). 

Assuming that (J,,) satisfies (10.87), the 4-semisymmetry of (/;,) leads to the 
so-called fundamental equation of IIM with the branching property 


A € Ux2 *) eR (* ) —. (*: Ux3 2) ah é >) 
p+qr Pq prr q pr 
for all (p,q,r) € D3 and all (x1,x2,x3) for which there exists an x4 with 


(x1, X2,.X3, x4) € D4 (a cocycle equation). 


Result 10.27. (Kannappan and Sander [521]). Let B be a ring of sets. An IM 
(in) on Dn X Ty is 3-semisymmetric and 4-semisymmetric and has the branching 
property if, and only if, there exist maps a: Di’ x i > R, b:D, x Bs — R, and 
c: Di! > Rsuch that 


In(X, P) = ala, pi) >) bi, pi) + (U «) 


i=2 i=1 


forall (X,P)€ Dy xn, n EN, n> 2. 
If, in addition, (In) is 3- and 4-symmetric, then (In) with 


n 
In(X, P) = >) bGi, pi) 
i=1 


has the so-called sum form property with generating function b. 


Remember that nearly all known information measures have this sum form prop- 
erty. Thus it is surprising that this result holds without any regularity condition. The 
proof is rather long and is based on many earlier works. 


11 


Abel Equations and Generalizations 


In this chapter, Abel’s equation, exponential iteration, associative and commutative 
equations, trigonometric equations, and systems of equations are treated. Generali- 
zations and connections to information measures are treated. 

Hilbert, in his famous address to the International Congress of Mathematicians 
held in Paris in 1900 [372], posed many unsolved problems. The second part of 
his fifth problem is devoted in general to functional equations and in particular to 
functional equations treated by Abel, who was the first to treat functional equations 
systematically. He presented a general method of solving equations of quite general 
forms. For the most part, he solved functional equations by reducing them to differ- 
ential equations. Hilbert posed the question as follows: 


Moreover, we are thus led to the wide and interesting field of functional 
equations which have been heretofore investigated usually only under the 
assumption of the differentiability of the functions involved. In particu- 
lar the functional equations treated by Abel (Oeuvres, vol. 1, pp. 1, 61, 
389) with so much ingenuity ... and other equations occurring in the lite- 
rature of mathematics, do not directly involve anything which necessitates 
the requirement of the differentiability of the accompanying functions. .. . 
In all these cases, then, the problem arises: In how far are the assertions 
which we can make in the case of differentiable functions true under proper 
modifications without this assumption? 


This chapter is devoted to the study of the equations considered by Abel and 
some generalizations applied in particular to information (theory) measures. 

Abel had four publications and three manuscripts [1, 2, 3, 4, 5, 6, 7] containing 
functional equations; see the nice, informative survey article by Aczél [33] giving a 
status report. Abel presented a general method of solving quite general functional 
equations by reducting them to differential equations. Keeping in mind Hilbert’s 
question, we will see how recent results succeeded in reducing the regularity suppo- 
sitions to weaker regularity conditions or nothing at all. 

Indeed, differentiability conditions used by Abel have been successfully replaced 
by much weaker conditions. Many researchers have treated various functional 
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equations without any regularity assumptions. Now we deal in depth with the equa- 
tions treated by Abel. 
Some of the equations considered by Abel are: 
(AFE1) f(z+w)=f@)fWw), f:C-C (exponential equation), 
(AFE2) f(¢(x))=f@)+1, f:@,b)—>R (iteration equation), 
(AFE3)  F(«,F(y,z)) = F(z, FQ, y)) (associative-commutative 
= F(y, F(z,x))= F(x, F(Z, y)) equation), 
= F(Z, Fly, x)) = FQ, F(, 2)). 


F:IxI-—- 1, I C Risa proper interval. 


x+y : 
(AFE4) f(x)+fo)=f (—) , x,y ER (arctan equation), 
(AFE5)  $(x + y) = GQ) fF) + F@)4(), 
¢,f:R-R, 


(AFE6)  yw(x+y)=g(xy) +h —y), 
WY, gh : R = R, 


(AFE7)  $(x) + O(y) = wOf(y) + yf), 
f:ROR, ¢:R OR, 


is a system of functional equations, and 


(AFE8) (x +y)¢(x—y) =) £0)? - 60) FQ), 
(AFE8a) f(x +y)fe—y) = fy fOY -7d@yd0y, f£¢:R? SR, 


x y _ y x 
amy ¥(-*2-25)-r(23)+¥(5) 


w(x) — w(y) — log(l — x) log — y), 
(AFE9a) w(x) + yw —x)=c-—logxlogd—x), forx, y €]0, 1[ 


(again a system). 


11.1 Solutions of Abel Equations 


To obtain the solutions of these equations, we treat the equations in the order listed 
above. 


11.1.1 (AFE1)—Exponential Equation f(z + w) = f(z) f(w) 


Abel justified “Newton’s binomial series” by using the exponential equation (AFE1). 
Abel solved this equation by assuming continuity and not differentiability. This 
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equation is solved under the weaker assumption of measurability (Kac [407], Aczél 
[33]). Obviously, f is zero everywhere or nowhere (see Theorem 1.36). We con- 
sider only the nontrivial solution. To start with, restrict f to R; that is, f : R — C. 
Clearly 


(11.1) Ifa+yIl=IfC)IFO), forx,y eR, 
and 
(11.2) Xa+y)=7XX)XV), forx,y €R, 


where v(x) = Sa with | y(x)| = 1; that is, y : R — C is a measurable character 
(it satisfies (E)). Since y is bounded, y is Lebesgue integrable. Integrate (11.2) over 
a toc with Fig x(y)dy = d # 0 (otherwise vy = 0 a.e.) to get 


ren / foie / eG Say: 


that is, 


1 x+c 
oeral god Ckoad. 


+a 


Since y is integrable, the right side is a continuous function of x and so is the left 
side y. Since y is continuous, it is differentiable. Differentiate (11.2) with respect to 
y and then put y = 0 to have 


bx 
> 


yi(x) =ay(x) andthen y(x) =ae forx € R, 


where a,b € C. This 7 satisfies (11.2) provided a = 1 and |y(x)| = 1 implies 
b=if, 6 €R. Thus 


x(x) = ef 


for some real f (see Theorem 1.38 and Hille and Phillips [376]). 
As to the measurable solution of (11.1), it is known (see also Theorem 1.37) that 


|f(x)| =e forx € Randsomea ER. 
Now, 
FQ) = fle tiy) = FO): fy) = et HH. errrtieay = ert 
for some c1,c2 € R, y, 6 € C (see also Result 1.41 and [44]). Thus, the measurable 
solution f : C > C satisfying (AFE1) or (E) is given by f(z) = e+, where 


c,déeC. 
For the general solution f : R — C of (E), see Result 1.40. 
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11.1.2 (AFE2)—Iteration Equation f (¢(x)) = f(x) +1 


The literature of this iteration equation is large (see [554, 560]). We quote here two 
results. 


Result 11.1. (Kuczma [552]). vi ft, : Ja, bl > R satisfy (AFE2). Suppose f 
is concave, a < f(x) < x in Ja,D[, fC) > 0, and eye of = 1. Equa- 


tion (AFE2) has a convex solution ¢ in Ja, b[. Further, ihis-s ee is unique up to 
an additive constant and is given by 


f(x) — f" (xo) 
oo Ff" (x9) — £0)” 
where xq is an arbitrary element in |a, b[ and f"(x) stands for the nth iterate of 


f (x). Also, the regular iterates of f (x) are given by f"(x) = d~'(¢(x) +u), where 
¢ is given as above. 


p(x) = lim 


11.1.3 (AFE3)—Associative, Commutative Equations 


Abel solved this system of functional equations by reducing them to partial differen- 
tial equations; that is, effectively using differentiability. The associativity equation 


(11.3) F(x, F(y,z))= FFQ,y),z), forx,y,zel, 


where /’ is a real interval, has been completely solved under continuity and cancella- 
tivity conditions without assuming differentiability or commutativity (see [12]; here 
again the literature is voluminous). Here we quote a result in Paganoni Marzegalli 
[658]. 

Result 11.2. Let I’ be a proper real interval and F : I' — I' be continuous, satisfy 
(11.3), and be cancellative ((A y € I’ : F(x1, y) = F(@x2, y) > x1 = x2 and4Ax € 
I’: F@,y1) = FQ, y2) > yi = yo). Then there is an f : I' > R that is 
continuous and strictly monotonic such that 


(11.3a) Fa, y=f'F@)+fO)) forxyel’. 
The converse also holds. 


Remark. Using cancellativity and F(x, F(y,z)) = F(x, F(z, y)), we get the 
commutativity F(y,z) = F(z, y), and then from F(x, F(y,z)) = F(z, F(x, y)) 
we get the associativity (11.3). Thus this system is referred to as the associative- 
commutative equation. 


For the solution of the commuting system 
F(x, F(y,2z)) = Fy, F(Q@,2)) forx,yeS, zeT, 


where S, T are compact under continuity of F : S x T — T and weak transitivity 
(see [65]), the result is as follows. If S is a commutative monoid, with c as unit, under 
an operation o, there is a continuous surjection f : S — T such that F(x,z) = 
Sf (x)oz (actually f(x) = F(x, c)). 
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11.1.4 (AFE4)—Arctan Equation 


This equation has no continuous solution on R except f = 0. There exist local 
solutions and global but multivalued solutions (see [536, 602]). 


11.1.5 (AFE5)—Trig Equation ¢(x + y) = $(x) f(y) +0) f(@) 


Abel solved (AFE5) without using derivatives. Note that this equation determines 
two unknown functions. The general solution of (AFES5) is obtained in Chung, 
Kannappan, and Ng [162] without using any regularity conditions (see Corol- 
lary 3.56a and [474]). 

Let S be a semigroup (not necessarily commutative) and F a field. Suppose 
& (€ 0) and f : S > F satisfy (AFE5). Then there exist exponentials E, E; and 
additive A such that 


$= a(E1- 6), f= (Eth) $=EA, fk. 


Regular (measurable, continuous) solutions of (AFE5) for ¢, f : R > R are 


: . 1 : : 
H(z) = ale —e), f@)= SM +e"); 6G) = dre™, fa) =e, 


where a, c, c1, d are constants. 

An equation equivalent to the cosine equation (C) was solved by Abel in [1] using 
derivatives. For the general solution of (C) on groups without using any regularity 
conditions, see Theorem 3.21 [424]. 


11.1.6 (AFE6)—y( + y) = g(vy) + h(x — y) 


Abel has given the differentiable solution [1]. The general solution of this equation 

is obtained for yw, g,h : F — G, where F isa field and G an Abelian group. While 

investigating the dependence of the quadratic difference f(x + y) + f(w — y) — 

2 f (x) —2 f(y) (which will be considered in Chapter 13) on the product xy and some 

generalizations, the authors came across (AFE6). First we give some preliminaries. 
Suppose y, g, 4: R > R satisfy (AFE6). Then 


(11.4) g(x?) — g(0) =h(2x) — h(0) = w(2x) — w(0) 
(11.4a) = (0) -— g(-x’) 
(11.4b) = g(x? — a”) + g(a”) — 2g(0) 


holds for x,a € R. Also, y(O) = g(0) + A(0). 
Set y = x in (AFE6) to have 


y (2x) = g(x) +h). 
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Put x = 0 and y = 0 separately in (AFE6) to get 


w(x) = sO) +h@), wo) =8O0)+h(-y) forx,y eR, 


showing thereby that / and thus y are even. 
So, h(2x) — h(0) = w(2x) — g(0) —h(O) = g(x?) — g(0), which is (11.4). 
Further, y = —x in (AFE6) yields 


g(0) = g(—x?) + h(x) = g(x’) — g(0) +h) + g(-x") 
or 
g(x”) — 80) = 80) — g(-x?), 
which is (11.4a). 
Finally, for any a € R, replace x by x +a and y by x — a in (AFE6) to obtain 
y (2x) = g(x? —a*) + h(2a); 


that is, 


g(x”) — 2(0) + w0) = g(x? — a) + g(a”) — (0) + h(0), 


which is (11.4a). 
Note that (11.4), (11.4a), and (11.4b) hold on a field to an Abelian group. Now 
we consider (AFE6) from Z7 to Z7 and prove the following theorem. 


Theorem 11.3. Let yw, g,h : Z7 — Z7 and satisfy (AFE6). Then the functions are 
given by 


g(x) = g(0) + x(g(1) — g(0)) = g(0) + A), 


2 
= ¢(2x2) — e(0 0) = A(x? ‘joe oe 0), 
(11.5) w(x) = g(2x*) — g(0) + y(0) (2x~) + w(0) (F)+v0 


x2 


io) = 42x) 0) 440 =A() +40, 


for x € Z7, where A: Z7 — Z7 is additive. 


Proof. Since the domain is small, the solution can be easily handled by direct 
computation. Any additive A : Z7 — Z7 is of the form A(x) = xA(1). Take 
A(1) = g(1) — g(0); that is, A(x) = (gC) — g(0))x, x € Z7. Letx = 3, a=6in 
(11.4b) to have 

8 (2) — g(0) = 2(g(1) — g(0)); 
that is, 

g(2) = g(0) + 2(¢(1) — g(0)). 
With x = 3, (11.4a) gives 


g(2) — g(0) = g(0) — g(-2); 
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that is, 


g(—2) = g(5) = 2g(0) — g(2) = g(0) — 2(gC1) — g(0)) 
= g(0) +5(g(1) — g(0)). 


Put x = 6 in (11.4a) to have 
g(1) — g(0) = gO) — g(-D): 
that is, 
g(6) = g(0) + 6(g(1) — g(0)). 
In (11.4b), take x = 3 anda = 2. Then 
g(2) — g) = 85) + g(4) — 28(0) 


so that 
g(4) = g(0) + 4(g(1) — g(0)). 
Finally, in (11.4b), take x = 2 anda = 1. Then 
g(4) — g(0) = g3) +8) — 8) 


gives 
g(3) = g(0) +3(g(1) — g(0)). 
Thus 
g(x) = g(0) + x(g) — g0))=g0) +A) forx € Z7. 


From (11.4), we get 
a? 
h(2x) = A(x?) +h(0) or A(X) =A 7 +h(0O) (replace x by 4x) 
since ; = 2, and 


2 
w(2x) = A(x?) + w(0) = (=) + y(0) forx € Z7. 


This proves the result. 


Indeed, solutions of (AFE6) from R — R or F — G are of the form (11.5). 
These solutions can be obtained by reducing them either to the Pexider equation 
(PA) or the additive equation (A). Note that this equation determines three unknown 
functions g,h, y. 
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Result 11.3a. [33]. Suppose g,h, w : R > R satisfy (AFE6). Then 


g(y) = g(ty) + Dbyl(gG) — g(@)), 


y? y? 
h(y)=e14—F¢ 9+] —| @O)-— ge) -g0) +40), 
(11.5a) 4 4 


ia y" 
viy) = (| =|) + | (g(1) — g(0)) — g)+ y(0), 


for y € R, where [y] and {y} are the integral and fractional parts of y € R, respec- 
tively. Further, the general solution of (AFE6) is given by (11.5). 


Proof. The proof of (11.5a) is from Kannappan and Bunder [481]. First we will show 
that 


(x2) = e({x7}) + be71(g(1) — g(0))_ forx ER. 


Note that x = [x] + {x} forx € R. 
In (AFE6), replace x by x + 1 and y by x — | to get 


yw (2x) = g(x? - 1)+A(Q2), 
which, by using (11.4), gives 
g(x?) — ¢(0) + w(0) = g(x* — 1) + g(1) — g(0) +20) 
or 
g(x?) = g(x* — 1) + g(1) — 8(0). 
If x? —1 <0, then 
g(x?) = g({x7}) +0: (g(1) — g(0)). 


Otherwise (for x2 — 1 > 0), 


g(x?) = g(x? — 2) +2(g(1) — ¢(0)) 


= g(x” — 3) + 3(g(1) — g(0)), if x? —2>0, 
= g(x” — [x7]) + [x71(g(1) — 2@)) 
= g({x7}) + [x71(g(1) — g(0)), forx ER. 


From (11.4a) and (11.4b), we obtain (take a = 1) 


g(—x”) = —g(x? — 1) — g(1) +3¢(0) 
= g(1 — x”) — (g(1) — g(0)). 


If 1 — x? > 0, then 


g(—x*) = g({-x*}) — (g(1) — g(0)). 
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Otherwise, for 1 — x? <0, 


g(—x*) = g(2 — x”) — 2(g(1) — 20) 
= g([x* ete x*|(g(1) — g(0)) 
= g({—x?}) +[-x7](e(1) — 2(0)) forx ER. 


Thus, 
gy) = s(fy) + yg) — g@)) foryeR. 

By replacing x by 5 in (11.4), we obtain h(y) and w(y) in (11.5a). This proves 
(11.5a). 

This equation was studied in Roscau [709] and Sablik [720], and the measurable 
solution is obtained in [720]. Now we determine the general solution of (AFE6) 
without assuming any regularity condition. 

Note that (11.4a) is the same as g(x) — g(0) = g(0) — g(—x) for x € R. From 
(11.4) and (AFE6), we have 


a = 2 
(Z) +1 = 00-6) v0) trnyen 


that is, 


gtu+v) = gu) — g(—v) + 8(0) 


(x- &ayy" 


by the transformation xy = u and = v for u,v € R. Note that the equation 


ay > 0. Hence 


above is valid for all u,v € R with v i u+o= 
gtu+v) = g(u)+g(v)—g(0) foru,v € Rwithvo,u+ov>0. 
Define 
(11.6) A(x) := g(x) — g(0) forx ER. 
Then the equation above becomes 
(11.7) A(tu+ov) = A(u)+ A(v) 


for all u, o € R, with v, u +v > 0. Now we extend A to all of R?. 


u+tv>0 ee ae 
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Choose u,v € R such that u + v0 < 0. Then there exists a real number x > 0 such 
thatx +u>Oandx+u+v0>0. 
By using (11.7), consider 


A(x) + A(u+v) = A(x +u+0) 
= A(x +u)+A(v) 
= A(x) + A(tu) + Av). 


Hence we have 
A(u +0) = Au) + AQ), 


for allu, vo € R, withu +o < 0. Thus A is additive on R. 
Therefore, by (11.6), we obtain 


g(x) = A(x) +, 


where a = g(0). Using (11.4), we get 


2 2 
vona a(S) rap and no=a(Z) + 


where £ = h(0). This proves (11.5) and the result. 


Corollary. [f any one of g,h, w in Result 11.3a is measurable (or continuous), then 
cx? cx? 
g(x) =cx +d, A@)=—- +5, UG y= ota +o, forx ER, 


where b,c, d are constants. 


(AFE6) on a Field 


The general solution of equation (AFE6) for maps y, g,i : F —> G, where F is 
a field belonging to a certain class a and G is an Abelian group, is obtained. The 
class a includes all finite fields, the rationals Q, and fields F with the property that 
for each x € F either x or —x is a square. This includes R and C. The functional 
equation (AFE6) is closely linked to the functional equation 


f@?—y)=f@)—fO") fors,yeF, fi F >G. 
For many fields, this forces f to be a morphism from the additive group of F into G. 


Result 11.3b. [146]. Let F belong to class a, char F 4 2, and F 4 z3, 25. Then the 
general solution of (AFE6) for y, g,h : F — Gis given by (11.5) for some additive 
A on F and arbitrary constants g(O) and g(1) in G. 


11.1 Solutions of Abel Equations 479 


11.1.7 (AFE7)—¢(x) + o(y) = WOxf(y) + yf (&)) 


Abel obtained the differentiable solution by repeated differentiation [1], where 
¢, f, w are unknown functions mapping reals into reals (without precisely stating 
the domains of these functions). This is one of the places where Abel noted the 
remarkable fact that one functional equation determines several unknown functions. 
The conclusion he drew from this fact is equally remarkable: 


Thus, it is generally possible to find all the functions by means of a single 
equation. It follows that such an equation can exist only very seldom. 
Indeed, since the form of an arbitrary function appearing in the given condi- 
tional equation, by virtue of the equation itself, has to be dependent on the 
forms of the others, it is obvious that, in general, one cannot assume any of 
these functions to be given. Thus, for example, the above equation could not 
be satisfied if f(x) had any other form than that which was found. 


It is not quite clear why Abel singled out this equation. He started, however, from 
the solutions 


f@)= 5, v(x) = p(x) = log, 
f(x) =V1—x?,) w(x) = d(x) = aresinx. 


These are solutions only on subsets of the real plane (the first on Ry ={(x,y)|x> 
0, y > O}, the second on {(x, y) | x, y € [—1, 1]} (there are also problems with the 
functions being multivalued in the second solution). So it is preferable to find the 
general (at least continuous) solutions on such subsets (say, on regions). 


Continuous Solutions of (AFE7) 


Let J’ be an interval in R containing 0, and y, ¢, f are real functions. For x, y € 
dom f/f, define 


A f(x,y) =xf(y) + yf @) 
and 
Al-(x) = A(x, x) = 2xf (a), 


the diagonalization of A. 

The case f(0) = 0 is not of much interest. It leads to the solution f arbitrary 
(continuous) for x 4 0, y on Af(J’, J’) is constant, and ¢ is constant. An inter- 
esting case is f(0) & 0. With F(x) = f(x)f()7!, 6%) = w(f()x) — (0), 
(AFE7) can be reduced to 


O(x) + O(y) = 0(x F(y) + yF(x)) = O(AF(@, y)), 


and this to 
F(Ar(Q, y)) = FQ@)F(Y) + cxy, 
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which is pointed out to be essentially the associative equation (11.3) for 
Ar(x, y) =xF(y) + yF(x) 


and that if assumed for all x, y € R and if c > 0, this equation is equivalent to the 
“Baxter equation” 


h(An(x, y) —xy) =A(x)h(y). 


For linear operators on Banach algebra, the equation above defines the Baxter (or 
summation) operators connected to queuing theory [635]. 


Result 11.4. (Sablik [718]). Let I © R be an interval containing 0. Let y : 
Af’ x I') > Rand ¢ : I > R be functions and assume that f,@, y are con- 
tinuous solutions of (AFE7). Then (AFE7) has a list (seven) of lengthy, complicated 
solutions to be reproduced. 


Corollary. [718]. [f either ¢ or y in Result 11.4 is locally bounded from above (or 
below), then the assertions of Result 11.4 hold. 


In [718], the author obtains the solution of (AFE7) when 0 is not in the domain 
I’, assuming instead that 0 is a value of f; that is,0 € f(/’). 


11.1.8 System of Equations (AFE8) and (AFE8a) 


d(x + yo — y) =o) FO)? — 60) FQ), 
faty)f@-y) = fA fOY — cd)6O). 


In an unfinished manuscript, Abel reduces the system of functional equations 
(AFE8), (AFE8a) to differential equations of fourth order assuming f,¢:C > C 
are analytic. These equations are related to elliptic functions. In [348], entire solu- 
tions of (AFE8) alone are determined. 


Result 11.5. Suppose ¢, f : C > C are entire functions satisfying (AFE8). Then 
the solutions are given by 


(i) 6 =0, f arbitrary; 
(ii) d(x) = aye g(x), f(x) = +e*o;(x), @ = 1,2) oj cosine functions in 
the theory of elliptic functions; 
(iii) $x) =aixe®™, f(x) = te’; 
(iv) d(x) = aye2* sina3x, f(x) = +e”: 
(v) d(x) = aye sina3x, f(x) = +e%*" cos X3X, 


forx €C, aj, a, a3 are constants, and o, 0; are elliptic functions. 


Now we discuss continuous solutions of the system. 
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Result 11.6. (Bonk [127]). Suppose ¢, g : R > C are functions that satisfy 
(11.8) Pu + v) PU —v) = $(u)’gv) — POs, u,v ER". 
If @ is continuous, it is one of the following functions: 


plu) = ae (a(u)), 
d(u) = ae sina(u), 


o(u) = aet™a(u) forueR’, 


where o denotes the elliptic function, a,q : R" — C are given by 


n 


A(X1,-..,%n) = > cixi, Cr EC 
i=l 


(an R_ linear map), 


n 
q(X1,.--,Xn) = > Cij Xixj, ciy €C, 
i,j=l 


(a complex-valued quadratic form on R"), and a is a constant. 
Given ¢ 4 0, the function g is not uniquely determined by the functional equa- 
tion (11.8). Ifc,,c2 € C and ug € R" are suitably chosen, 


g(u) = cip(uy* +cod(u+uo)d(u—uo), uéeR". 


Continuous Solutions of the System 


As an application of Result 11.6, we get the continuous solution of the system 
(AFE8), (AFE8a). 


Corollary 11.6a. Suppose ¢ : R" > Cand f : R" > Care functions andc € C 
is a constant such that the equalities (AFE8) and (AFE8a) hold for all u,v € R". 
If 6 € 0 is continuous, then ¢ is one of the functions listed in Result 11.6 and f and 
c? are 

Gi) fu) = Lexp(q))ox(a(u)), c* = (e, — en)(ex — ev)/a*, (A, u,v} = 

{1, 2, 3}, 

Gi) f(u) = 4 exp(q(u)) cos(a(u)), ¢? = 1/a* or fu) = £exp(q(u)), ¢ = 0, 

(ii) f(u) = £exp(q(u)), c= 0, 


in the cases (i), (ii), and (iti) of Result 11.6, respectively. 
Conversely, every triple ¢, f,c?, as given above, satisfies the functional equa- 


tions (AFES8), (AFE8a). 
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If ¢ = 0, then the system (AFE8), (AFE8a) reduces to 
(11.9) flut+tv)ftu—v) = fu’ fo), u,v ER". 
The continuous solutions f : R” — C of (11.9) are given by 
fWwe= aeM™,  foru eR", 


where g : R” — C is the complex-valued quadratic form 


n 


q(x1, tee »Xn) = es AjjXiXj, 


ij=l 


and aj;;, a are constants. 


11.1.9 (AFE9) and (AFE9a) 


Finally, we consider the system (AFE9), (AFE9a), which connects to information 
measures that will be dealt with in the next section. 


ee 
ee a) = = y { ——— 
(apeoy "Nie Ty) \i—a) Nia 


w(x) — w(y) — log(l — x) log — y), 
(AFE9a) w(x) + yw —x)=c—logxlogi—x), x,y €]0,1]. 


There are so many functional equations that are connected to characterization of 
various measures of information and in particular with Shannon’s entropy (see Chap- 
ter 10). One of the equations one comes across in the characterization of Shannon’s 
entropy is the fundamental equation of information (FEI). 

First we give some notation. Denote 7 = [0,1], Jo = JO, 1[, Jo = i U 


(0,0), (1, D}, Js = 13 U{,0,0), (1,1, D), Le) = fk (PEE + SU-2) at, for 
x € Ip, and a a fixed element in Jo. 


Abel [1] considered the series 
2 n oO Ln 


X x x 
waaxt Geet ate =D. 


which converges for |x| < 1 in connection with (AFE9), (AFE9a), 


x oe log(1 — x 
A ee a ee £eie te 
n x 


So, 
w(x) = -| Wathen ®) 5. 


Xx 
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1 


He has given so many properties of y and values of y at a few points 1, 5, 


iT 
waa) = f 28 ay, 
1l-x 


etc., 


so that 

v(x) + wl —x) =c — logx log(1 — x), 
which is (AFE9a) (this can be obtained from (AFE9) by putting y = 1 — x and 
c = w(1) when | is in the domain), and 


$1.2 
c=yw(l)= ae ee 
Tn 6 


w(x) + w(-x) = sue), 


so that 


1 2 
v (=) = - (og2)’. 


It appears from Ramanujan’s notebooks [677, 341, 342] that he had found the 
values of 
* log(1 — ft) Sx 
(x) | ; pa = 


[1] for the special values x = 5. 25 5(V5 — 1) given by Rogers’s [707] result that 
x — 1-x 
Fis) + FQ) = Fay) — F (*C) — (202) 
1-—xy 1-—xy 


is an elementary function without the knowledge of these writers. Ramanujan dis- 
covered that F satisfies the functional equation 


F(a) + FQ) + FU =ay) +F (72) + F( —*) =3r«, 
— xy 1-—xy 


This was rediscovered by Rogers [707, 342]. This is to be found in [1]. In [535], the 
author reduces the system (AFE9), (AFE9a) to the equation 


(11.10) P(x) + FQ) + Flay) +F (Z—*) 4 F(T) <0 
1—xy l—xy 


for x, y € I or Jo, which is known as the Abel functional equation. The solution of 
(11.10) connected to (FED) is in the following theorem. 
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Theorem 11.7. [33, 206]. The general integrable solution F : I > Rof (11.10) is 
(11.10a) F(x)=cl(x), forx eT, 


where c is a constant. 


Proof. The solution is obtained by showing first that F is continuous, then dif- 
ferentiable, and then has derivatives of all orders. By using the derivative of F, 
(11.10) can be reduced to (FEI) for x, y € Jo, the fundamental equation, satisfied by 
f(x) =x(1 — x) F’(x). This connects to the Shannon entropy. 

To deduce the continuity and from it the differentiability of F from integrability, 
integrate (11.10) with respect to y from 0 to | and obtain 


1 1 
Fay=- fo Foydy- =f F(t)dt 


F(t) i: : FO 
ce rf =a ae a 


Since the right side is continuous in x on JO, I[, so is the left side; that is, F(x) is 
continuous. Then the right side is differentiable, and so is the left side. It is easy to 
see that F has derivatives of all orders (we need only up to order 3). Differentiate 
(11.10) with respect to x for x, y € Jo, and get 


; , l-y »(1-x wil=)) 27 12 = 
i aa le ae aa ee al (—)+“r (=) -o. 


which goes over to 


1-x 1-x l-y 
fy F( )- f= ay) taf ( ) =o. 
1—xy 1-—xy 1—xy 


where 
(11.11) f(x) =x(_—x)F'(x) forx € Ip. 


With the substitution u = 1—x, v=1—-— this equation transforms to (FED, 


=n 


(11.12) f(—u)+0—w)f (=) =f 4d-oF(T), 


foru,v,u+ove€eIlo. 
Set y = 0 in (11.10) to have 


F(x)+ Fd —x)=—F(0) -2F(), 
which, after differentiation, yields 


F'(x) = F’—x) forx € Io. 
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Thus (11.11) shows that 
fx) =fd-x) forx € Ip 


(f is symmetric), and (11.12) yields (FEI). 
The general measurable (integrable, continuous) solution of (FEI) is given by 
(see Aczél and Daréczy [43], Lajko [598], and Kannappan and Ng [489]) 


(11.13) f(x) =cs(x)+ax+b_ forx € lo. 


Now, since f(x) = f(1— x), because F is defined at 0 and 1,a = 0 = band (11.11) 
results in (11.10a). 
This proves the result. 


Remark 11.7a. If the domain of F is ]0,1[, thena 4 0, b 4 0 in (11.13). Substi- 
tution of (11.13) in (11.12) yields b = —2a, so that (11.11) gives 


x2 


> 
=X 


F(x) =cI (x) —alog 


and for x € Jp (linking F to the dilogarithm (Kiesewetter [535])) (11.10) has more 
solutions on Jo than on J (on smaller domains, a functional equation may have more 
solutions). 


Remark 11.7b. The Abel functional equation (11.10) is connected to (FEI) and to 
Shannon entropy. 


In the next section, we study generalizations to (11.10) and connections to other 
information measures. 


Problem 1. The general solution of (11.10) is not known. Determine the general 
solution of (11.10) (without using any regularity condition) on [0, 1] or ]0, I[. 


Summary. Either general solutions (without any regularity assumption) or solutions 
with weaker regularity conditions such as integrability, measurability, etc., of Abel’s 
equations are obtained, eliminating differentiability and thus answering the second 
part of Hilbert’s Fifth Problem. 


11.2 Generalizations and Information Measures 


In Chapter 10, we have seen that the well-known measures of information, such as 
the Shannon entropy, the directed divergence, and the inaccuracy, are characterized 
through sets of axioms with the help of several functional equations. One of the 
functional equations that arises in these characterizations is the fundamental equation 
of information theory. In the previous section, we have connected them to the Abel 
equation (11.10). Now we treat many generalizations of (11.10) and link them to 
other measures of information. We start off with the first generalization, 
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l-u 
) + KG =u +P ( ) =o. 
l—uo 


for u,v € I, where F,G,H,K,P : I — R, which is a generalization of (11.10), 
can be solved by reducing it again to (FEI). 


Theorem 11.8. (Kannappan [440]). Let F,G,H,K,P : I — R satisfy (11.14). 
If any one of the functions is Lebesgue integrable, then so are the remaining func- 
tions. Further, all these functions are differentiable in Ip, and the solution of (11.14) 
is given by 


(11.14) FWw)+GQ)+H ( 


u 
1l—uo 


F(x) =al(x)+ cq, 
G(x) = F(x) +2, 
H(x)=—-F(1—x) +03, 
K(x) = —F(1—x)+¢4, 
P(x) =—-F(1-x)+6¢s, 


for x € Ip, where a € Ip and the c;’s are constants. 


Proof. The solution of (11.14) is obtained first by eliminating four of the five 
unknown functions and rewriting it as an equation of the form (11.10) and then 
reducing it to the fundamental equation (FEI). 

Let F,G,H,K,P : [0,1] — R satisfy (11.14). v = 1 in (11.14) gives 


(11.15a) Kt) = —F0 —4) — GQ) — #01) = PO), 


and v = 0 in (11.14) yields 


(11.15b) H(u) = —F(1—u) — G00) — K(1) — P(1). 


Now u = | in (11.14) and (11.15a) gives 


(11.15c) G(v) = F(v) + G(1) + H(1) + P(O) — FC) — HO) — PC), 


and use (11.15c) and (11.14) with u = 0 to get 


P(») = —FU — 0) — G0) —2H(01)— PO) 
+ F(1) + H(0)+ P(1) — F(0) — K(1). 


(11.15d) 


Now (11.15a) with wu = 1 and (11.15b) with u = 1 give 
(11.16) -F(0) = K(1)+ G(1)+ H(01) + PO) = H0)+ G(0)+ K() + Pd). 


With the use of (11.15a), (11.15b), (11.15c), (11.154), and (11.16), (11.14) reduces 
to 


u(1 — v) 


(11.15) F@)+F@)— F( )- Heike (e - 
— Uv 


1l—uov 


; ) + FO =0. 
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Define 
(11.17) N(x) = F(x)-— F(O), x € [0,1]. 


Then (11.15) and (11.17) give 


(11.18) ww + No) = 6 (2) — vy (2 2) =o, 
1l—uo 1—uov 


an equation encountered before. 

Suppose any one of the functions, say F’, is Lebesgue integrable. Then K, H, G, 
and P are also Lebesgue integrable and N is also integrable. 

Integrating equation (11.18) with respect to v between 0 and 1, we get 


_ | l—u [“ N(t) 
N(u) = — a N(o)do + —— , a—1t 


N 
-=[ wood —a—w [ rad ‘ 


It is evident that N is continuous on ]0, 1[ and also that N is differentiable in 0, 1[. 
Thus N has derivatives of all orders (with the argument as in Theorem 11.7). 
Differentiating (11.18) with respect to u, we obtain 


dt 


N'(u) — Py Co ~) —vN' (uv) 
(1 — uv)? 1—uo 
(11.19) 
v(1 — v) N’ (< —”) =" 
"7 (1 — uv)? l—-uv ) 
forO <u, v <1. 
Define 
(11.20) f(x) =x(_—x)N'(x) 


for x € ]0, I[. Note that N’(x) = F’(x). Now (11.19) and (11.20) yield 


1 1- 1- 
f= 6 (2) — I pw) uy (E*) = 


for u,v € JO, 1[, which, by the substitution 


l-—u 


x=1l-u, y=l1- - 
1—uov 


reduces to 


(11.21) fl—x)-fG)-C-y) r(1 — )+a-of (1-4) =0, 
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forO <x, y <1, withx + y < 1. For y = x, from (11.21) we get 
1 
(11.22) f(x) =fd-~x) ford See 


Hence (11.22) holds for all x € JO, 1[, so that (11.21) can be rewritten as 


x 
FED f(a) + (1 -x)f (-) =f0)+0-yf (=) 
forx,y,x+y € ]O, I[. 
As N’ is continuous, f is defined by (11.20). Then it follows from [43, 489] that 
Ff (x) = cs(x) and that 


N(x)=c ae a log(1 — 2) 

1-x x 
for x € JO, 1[, so that from (11.17) it follows that F(x) = cI (x) for x € [0, 1], where 
c is an arbitrary constant. The functions K, H,G, and P can be determined from 
(11.15a), (11.15b), (11.15c), and (11.15d), respectively, using F(x). This proves the 
theorem that describes the solution of (11.14). 


Remark. If F,G,H,K,P are defined only on Jo, there exist more solutions of 
(11.14) [441]. 
For example, 


a 
F(u) =a log = G(u), 
l-u 
et? 
Hj Seiee = Plu), 
u 
u 
K(u) = —2a log G=0 
and 
F(u) = alogu?(1 —u) = G(u), 
u 
H(u) = ae ae = Plu), 
K(u) = —2a logu, 
as well as , 
F(u) = log —— = Glu) = Hw) = Plu) = Kw), 


for u € Ip, are solutions of equation (11.14). 


Problem. Determine the regular solution of (11.14) on Jp and the general solutions 
on J. 
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Now we consider other generalizations of (11.10) linked to other measures of 
information and (FEI). First we treat the functional equation 


l-u l-s 
Flu,s)+F(v,t)+F a) 

l—uv 1-—st 
(11.23) 


l1—v 1-t 
+ FC — uv, 1 —st) + F | ———, —— } =0, 


l—uv’ 1—st 
which is a two-dimensional generalization of (11.10). 


Result 11.9. (Kannappan [447]). Let F : Jz > R be Lebesgue integrable in the 
first variable and satisfy (11.23). Then F(x, y) = al(x) + L(y) logx — LU — y) 
log(l—x)+P()), where P satisfies (11.10) and L satisfies the logarithmic equation 


(11.24) L(l—xy) =L(1—x)+L(—y). 


Further, if F is Lebesgue integrable in the second variable also, then F is given by 


2 


F(x, y) = al (x) + bI(y) +c) logx log(1 — y) — c; log(1— x) log y — ca log = 


for (x,y) € ja. 
The solution is obtained by first showing that integrability implies continuity, 
which in turn implies the partial derivative, and then showing that F has partial 
derivatives of all orders with respect to the first variable. Then reduce (11.23) to the 
two-variable fundamental equation (with f(x, y) =x( —x)Fi(x, y)) 
(10.49) 
u D x y 
f@syyr OU =—2)f rR = flu,v)+U—u)f a aye 2 
1-—x l-y l-u 1l- 


D 


which links to the directed divergence and the inaccuracy [43, 535]. 
Next we consider a generalization of (11.23), 


l-u Il-s 
Flu,s) + Gv, t) + H | ——., 
l—uv 1-—st 
(11.25) 


l—v 1-t 
+ K(1—uv,1—st)+P ; = 0, 
l—uv 1-—st 


where F,G,H,K,P:J2 > R. 


Result 11.10. (Kannappan [461]). Suppose F, G, H, K, P : Jz — R satisfy (11.25) 
and any one of the functions is integrable with respect to the first variable. Then 
the remaining functions are also integrable in the first variable and all have partial 
derivatives of all orders with respect to the first variable and 


F(x, y) =al(x) + L(y)logx — L( — y)log(d — x) + By) +e 


for x,y € Io, where L satisfies (11.24) and f satisfies an equation of type (11.10). 
If the functions are integrable also with respect to the second variable, then 
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F(x, y) =al(x) + bI(y) 


2 
+ ci[logx log(1 — y) — log(1 — x) logy] — cp log = 
-y 


+3 


and the other functions differ from it by constants. 


Remark. 
a 52 
F(u,s) = alog + blog = Glu,5s), 
l-u l-s 
2: 2 
ie 
H(u,s) =alog . i digg Gy, 
l-u KY 
u2 Ss 
K(u,s) =a log —— POE ase’ 


etc., is a solution of (11.25) on i (on smaller domains, a functional equation may 
have more solutions). 


Problem. Determine the regular solution of (11.25) on i 


Finally, we consider the three-dimensional version of (11.10), namely 
(11.26) 
l-u 1-y I1-t 
F(u,y,t) + F(v,s,w) + F | ——, —— 


l—uv’ 1—ys’ 1—tw 


l1-—-v 1l-s l1-w 
+ Fd -—uv,1—ys,1—tw) + F | ———, ——, ——} = 0, 
l1—uv 1l-ys tw 


and 
(11.27) 


l1-u 1-y I1-t 
Flu,y,t)+ Go,s,w)+H ; ; 
l1—uv 1l—ys 1—tw 


l1-—v I1-s l-w 
+ K(U—uv,l—ys,1—tw)+P ; ? = 0, 
l1—uv 1l—ys 1—tw 


where F,G,H,K,P:J3 > R. 
Result 11.11. [461]. Suppose F defined on J3 — R is Lebesgue integrable in each 
variable and satisfies (11.26). Then F has partial derivatives of all orders and is 
given by 
F(u,v,w) =al(u)+bI(v) + CI (w) + Ci og( — v) logu — logo log(1 — u)) 
+ C2(log(1 — w) logv — log w log( — v)) 
+ C3(log(1 — u) log w — log(1 — w) log u) 


foru,v,w € Ip. 
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The solution is obtained by connection to the fundamental equation (of three 
variables) 


flny.d+0—oF( 7 4) 
l-x l-y 1l-z 
(10.59) 


= Fluo.w) +0 ws ( is os): 


l—-u’l—-v’ 1l—-w 


and thereby it connects to the generalized directed divergence. 
Now we will determine the integrable solution of (11.27). 


Result 11.12. [461]. Let F,G,H, K, P : J3 > R satisfy (11.27) with F integrable 
in the first argument. Then all are integrable in the first variable and are given by 


F(x, y,2) =al(x) + L(y, z)logx — LI. — y, 1 —z)log(1— x) + 80,2) + C1, 
H (x,y,z) = al (x) + L(y, z) logx — L(V. — y, 1 —z) log(1 — x) 
=~ (Wy, 1=e) Cs 
K(x, y,z) =al(x)+ L(y, z)logx — LU — y, 1 —z)log(1 — x) 
=p0=—y,l—Z) + C3, 
G(x, y,z) =al(x) + L(y, z)logx — LU — y,1 —z) log — x) + BO, z) + Ca, 


and 
P(x, y,z) =al(x)+ LYy, z)logx — LU — y, 1 — z) log — x) 
=pPIl=j,;l—z) +s, 
for x,y,z € Io, where C; are arbitrary constants and L satisfies 
LUi-xy,l—uv) =LO—-x,1l-—u)+LU-—y,1—vd) 
and f is a solution of 


rd-s) td—-w) 


dlrst) + (5,0) —d ( CE) —airs,t0) 
l-rs 1—tw 


-a(4 —r) w—t) 


> 1—tw 


)+co=0. 


l-rs 
forr,s,t, w € Ig, which is a generalization of (11.10). 
Remark. It is easy to check that 


2 


7 y? 2 
P(x, ¥,z) = alog 5 PEO 7 Ole 7 


—x -y 


is a solution of (11.26) on i 
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Also 


3 
F(x1,x2,.%3) = > aj log i 
i=! 


2 2 


x2 
— = G(x1, x2, x3), 
=a 


x: 1- 2 
H (x1, x2, x3) = > a; log — se dela BiG ceases, 
i=1 1 xj X3 
Z x? x3 
= : i 
K (x1, X2,.x3) = > a; log =< + 2a3 log C— 


i=l 
is a solution of (11.27) on Ee 


Problem. Determine the general solution of (11.26) and the regular solution of 
(11.27) on Jp. 


12 


Regularity Conditions— Christensen Measurability 


Regularity conditions and Christensen measurability and its applications are treated 
in this chapter. 

Conditions such as boundedness, monotonicity, measurability, continuity at a 
point, continuity, the Baire property, integrability, differentiability, and analyticity, 
for example, are called regularity conditions. To solve functional equations, it was 
customary to assume a rich regularity property like differentiability and reduce a 
functional equation to a differential equation and solve it. The trend for quite some 
time has been to solve functional equations under weaker regularity conditions like 
integrability or measurability or no regularity condition at all (solve algebraically). 

In connection with his fifth problem, Hilbert [372] commented that although 
the method of reduction to a differential equation from differentiability makes it 
easier to solve functional equations, the question is, is the differentiability assump- 
tion necessary? And how far are the assertions that we can make in the case of dif- 
ferentiable functions true under proper modifications without this assumption? (See 
Aczél [33] and Chapter 11.) This second part of the problem draws attention to what 
are now known as regularity problems. The main idea used in these investigations is 
to assume weak regularity properties of the unknown function(s), say monotonicity, 
measurability, continuity, etc., and to obtain strong regularity conditions, for exam- 
ple integrability, continuity, differentiability, or analyticity. Results of this kind are 
called regularity theorems for functional equations (Jarai [390]). First we illustrate 
these with specific examples, mostly from Cauchy equations. Then we present some 
general theory. This chapter ends with Christensen measurability. 


Illustrative Examples 
Differentiability 
Let f:R-R. 
(1) Suppose f is differentiable and satisfies (A). Differentiate with respect to y to 
et 
; S@+y)= fy) 
so that f’(x) =c = f’(0) and f(x) = cx +d. Since f(0)=0, f(x) =cx. 
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(ii) 


(iii) 


(iv) 


(v) 


(vi) 
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Let f be a nonconstant differentiable solution of (E). Differentiate with re- 
spect to y to have 


f@+y) = f@S'O) 


so that f’(x) = cf(x), c= f’(0). Ifc = 0, then f’(x) = 0 or f(x) = con- 
stant, which is not the case. Then we see that f has derivatives of all orders 
and f(x) = c" f (x). Note that f is never zero and f (x) = e®. 

Suppose f : Ri. — R is a nonzero differentiable solution of (L). As before, 
differentiating with respect to y, we get 


xf'(y) = f'(y) 
or 
xf'(x)=c= f’(). 
c = 0 would imply f = constant. Then f has derivatives of all orders and 
f(x) =clogx. 


Let f : Ri — R be a nonconstant differentiable solution of (M). Again 
differentiating with respect to y, we obtain 


xf' (xy) = f@)F'(); 
that is, 


xf'(x) =cf(x), c= f'(1). 


c = 0 would imply f(x) = constant. Then f has derivatives of all orders and 
f (x) = x* (note that f(x) 4 0 for x 4 0). 


Let f : R — R be an integrable solution of (A). Integrating (A) with respect 
to x in an interval [c,d], we have 
d d d+y 
@-of0)= f fet ydx- fo peoax= f pwau ter 
c c ct+y 


Thus f is continuous and then f is differentiable. Use (i) (see Hille [374], 
Aczél [12], etc.). 

Suppose f : R — Ris a nonzero integrable solution of (E). Note that f is 
never zero and positive. Integrating with respect to x over [c, d], we get 


d d 
/ ee ores / Fle)dx: 


that is, 


d+u 
Ose i f@ddu. 


+u 


Hence, as before, f is differentiable, and use (11) (see also Dhombres [218]). 
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(vii) [44]. The general solution bounded, from one side, on a set of positive measure 
f :R* = R of (L)is given by f(x) = clog |x|, x € R*. 

(viii) [44]. The general solution f of (E) on R in the class of functions bounded from 
above on an interval or on a set of positive measure is given by f(x) = e™ for 
x ER, (R4). 

(ix) [44]. Nonzero f : Ri — R continuous at a point or bounded on a set of 
positive measure satisfying (M) is given by f(x) = x° forx > 0. 

(x) For similar results about (A), see Theorems 1.1 and 1.2. 

(xi) Every f : [1, co[ > R of (L) bounded on [1, a] is clog x. 

(xii) Suppose f on R is a measurable solution of (A). Then f is continuous (see 
Theorem 1.1, [44, 426, 218]). Every measurable solution f on R of (E) is 
continuous (Hewitt and Ross [369]). All measurable solutions f of (C) on R 
are continuous (see [424, 44], Remark 3.19). 

We encountered many functional equations in information theory in 
Chapter 10. Every solution of (FEI) that is measurable or has the Baire pro- 
perty is continuous and has derivatives of all orders. Also, integrable solutions 
of (FEI) are continuous and infinitely differentiable. Measurable solutions 
fi @ = 1 to 4) :]0, 1[ > R that satisfy 


— ) = A040 =f (=) 
—x l-y 


for x, y € JO, 1[ with x + y € ]0, I[ are continuous and infinitely often differ- 
entiable (Ebanks, Sahoo, and Sander [245]). 
Another equation one comes across is the sum form functional equation 


erase: ( 


n m m 


YD fics) = Deer + Dom 0), 


i=1 j=1 


(i) =xETh, y=) Tne 
Let fij,gi,hj :J0,1[ > R@ = 1,2, 7 = 1,2) satisfy the functional 
equation above. If fj;,;, g; are measurable, they are continuous and differ- 
entiable (see [245, 218, 452], Chapter 11). 
If fi; = gi =h; = f is measurable, then f is continuous [448, 198, 44]. 
(xiii) Let F : R* > C be a nonconstant solution of the equation 


F(x, y)F(u,v) = F(xu — yo,xv + yut+ yo) 


measurable on a set of positive Lebesgue measure. 
Then f is continuous and given by 


(x + jyer sete) for (x,y) # (0,0), 


F(x,y)= 
(*.¥) f forx =O=y, 
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(xiv) 


(xv) 
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where j = Li iBir:R2? > R, then 


(x?+y?+xy)? for (x, y) 4 (0,0), 
Fa,y)= 

0 forx =O=y 
(see [313, 37]). 
In Chapter 7, we introduced the difference operator Aj, 


A} f (&) = Anf (x) = f@ +h) — fF), 
Af) => (p+ (I) ro + kh). 
k=0 


If f : R — R is a measurable function such that Le f(x) = 0 almost 
everywhere (a.e.) on R forn € Zi and h € H a suitable subset of IR, then 
f is equal a.e. on R to a polynomial of degree at most n. In particular, every 
measurable polynomial function f : R > Ris a polynomial. 

It is not that all measurable solutions of a functional equation are continuous; 
continuous solutions are differentiable, bounded above or positive solutions 
are continuous, etc. 

Let f : [0, oo[ > R such that f(0) = 1, f(x) = 0, for x 4 0. Then f 
satisfies (E) and is measurable but not continuous; f (0) = 0 and f(x) = 1| for 
x #0, and x € Risa measurable solution of (M) but not continuous. 

Define f : R > R by f(x) = |x|. Then f satisfies (M) and is continuous 
but not differentiable. 

The Aczél-Benz functional equation [38] 


f+ foO)=f@a)+faty—f@)), where f: RR, 


has the continuous solution 


FO) = 5 £ Ie), 


which is not differentiable. 


Let X be a linear space over Q endowed with a topology such that X is a sepa- 
rable topological space with the mapping (A,x,z) — Ax + z continuous in each 
variable for 2 € QO, x,z EX. 

Let Y be a linear space over Q that is endowed with a topology such that Y is a 
separable and metrizable topological space with the mapping (u, y, w) > wy +w 
jointly continuous with respect to the triple (u, y,w)€ Ox Y x Y. 

Let M be a o-algebra of subsets of X. A function f : X — Y is M-measurable 
if and only if f—'(V) € M for every open set V C Y. 

Let N be a proper o-ideal of M. For c € M\N, c, © X dense in X with 
ctcEeM, X\(c +a) EN. 

Let H be a countable, dense subsemigroup of X. 
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Result 12.0. (Kuczma [549]). Suppose K is a countable, dense vector subspace of 
X over Q with the following property. 

Let H €N be a subgroup of X such that Ho = HM K is dense in X; AP +z€ 
N, AR+z2¢€Mforeveryi € QO, z€ X, PEN, REM; and M contains all 
Borel subsets of X. If an M-measurable function f : X — Y satisfies the condition 


aman 163) =0 forallxe X,heH, 


then f is a continuous polynomial function of order n. 

Ifan M-measurable function f : X — Y satisfies for every h € H the condition 
NEO = 0, N~ (a.e.), then there exists a continuous polynomial function @ : 
X = Y of ordern such that f (x) = $(x), N— (ae.). 


Result 12.0a. Every Lebesgue measurable or Baire measurable polynomial f : 
R — R (ef order n) is continuous and hence is a real polynomial (of degree at 
most Nn). 


12.1 Some General Results 


The following results show that, under quite general circumstances, measurability of 
a solution implies continuity [390]. 


Baire Category 


Definitions. A subset F of a topological space X is of first category if F can be 
represented as a countable union of nowhere dense sets (i.e., sets having closure 
with empty interior); otherwise F is said to be of second category. 

A topological space X is called a Baire space if the intersection of any countable 
class of dense open sets is dense. The theorem of Baire states that locally compact 
spaces and complete metric spaces are Baire spaces. 

Let F bea subset of the topological space X. We say that F is of second category 
ata point x € X if F M V is of second category for every neighbourhood V of x. 
Let D(F) denote the set of all points x € X for which F is of second category at x. 
Then D(F) = @ if and only if F is of first category. Moreover, D(E) is closed and 
the set F\\D(F) is of first category. 

We will say that F C X has the Baire property if there exists an open set V 
such that the symmetric difference FAV is of first category. All subsets of X having 
Baire property form a o-algebra. Of course this o-algebra contains Borel sets, the 
members of the smallest o-algebra containing all open sets. A set F C X has the 
Baire property if and only if F\D(F) is of first category. The set F has the Baire 
property if and only if each point x of X has an open neighbourhood U such that 
U 1 F has the Baire property in X. 


Baire Property of Functions. The function f has the Baire property on E if the 
domain of f contains F except for a set of first category, the range of f is ina 
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topological space Y, and F  f~!(V) has the Baire property in X for every open 
subset V of Y. We simply say that f has the Baire property if it has the Baire property 
on X. 

This definition is very similar to the definition of a Borel function. A function 
jf mapping some subset of a topological space X into another topological space Y 
is called a Borel function if for each open subset V of Y the set f—!(V) is a Borel 
subset of X. 

The properties of functions having the Baire property are very similar to the 
properties of measurable functions. 


Result 12.1. (Jarai [390]). Let X be a completely regular space and X09 a o -compact 
completely regular space, and let X; (i = 1,2,...,n) be completely regular spaces 
with countable bases. Let Y; (i = 1,2,...,n) and Y be locally compact spaces 
and T be an arbitrary topological space. Let v be a Radon measure on Y and let 
Mi G@ = 1,2,...,n) be Radon measures on Y;. Suppose that v(Y) < oo, mi(Yi) < 
oo i = 1,2,...,), and let C be a subset of T x Y. Consider the functions f : T > 
X, fo: Y > Xo, fi: Yi m Xi, h: C xX XQ X XX +++ KX Xp > X, Bp CO 
Y; @ = 1,2,...,n). Let to be a fixed element of T, and suppose that the following 
conditions hold. 
For each (t, y) € C, 


(12.1) f@Q)=htt,y, fo), filgi@, y)),.--. frlgn(t, y))); 
h is continuous; the function fj is uj-measurable on Y; (i = 1,2,...,n); gi is 
continuous on C (i = 1,2,...,); C; is measurable and there exists ann > 0 such 


that v(C; 1 Cy) = yn for eacht € T; and for each ¢ > 0 there exists a 6 > 0 for 
which wi (giz(F)) = 6 whenever F C C;, v(F) > e, t € T, and1 <i <n. Then 
f is continuous at the point to. 


Result 12.2. [390]. Let X be a completely regular space, Xo a o-compact com- 
pletely regular space, X; (i = 1,2,...,n) completely regular spaces having count- 
able bases, and T an arbitrary topological space. Let Y be an open subset of R*, Y; 
an open subset of R" (i = 1,2,...,n), and C an open subset of T x Y. Consider the 
functions f :T > X, fo: Y > Xo, 91: C2 Yi, fi: Yi m Xi G@ = 1,2,...,n), 
andh:C x Xo xX X1 X +++ X Xp > X. Suppose that to € T and the following con- 
ditions are satisfied. 

For each (t, y) € C, (12.1) holds; h is continuous; f; is A measurable on 
the subset C, of Y; G@ = 1,2,...,n); gi is continuous on C (Gi = 1,2,...,n); 


n 
ak ( U ee (C;) } > 0; and the partial derivative cB is continuous and has rank r; 
i=l” * 
for each (t,y)€C G@ = 1,2,...,7). 
Then f is continuous on a neighbourhood of to. 


Result 12.3. [390]. Let T, Y, Xj, ¥; G@ = 1,2,...,n) be topological spaces, Xo a 
o-compact space, and X a completely regular space. LetC CT x Y, Ci C Xi G@ = 
1,2,...,), and to € T, and consider the functions f : T > Xn, fo: Y > Xo, 
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gi: Co Xi, fi: Xi 2 Yi @=1,2,...,n), h: C x Xo x Xx: x Xn > X. 
Suppose that the following conditions hold. 

For each (t, y) € C, we have (12.1); h is continuous; f; has the Lusin-Baire 
property on the subset C; of Xj i =1,2,...,n); gj is continuous (i = 1,2,...,n); 
there exist sets V and K such that V x K CC, V is open, to € V, K is of second 
category and has the Baire property, and 


n 
=1 
K Cf) 8ig(Ci; 
ol 
and if F is a second category subset of Y and F C K, then git(F) is a second 
category subset of X; whenever 1 <i <nandt eV. 
Then f is continuous on a neighbourhood of to. 


Problem 12.4. [29]. Let T and X be open subsets of R? and R”, respectively, and 
let C be an open subset of T x T. Let f:T > X, g3: C>T G@ =1,2,...,n), 
andh:C x X"*+! —. X be functions. Suppose that 


(12.2) f®Q=ht.y, fO), flgil, y)), ---5 (gn (t, y))) 


whenever (tf, y) € C; A is analytic; g; is analytic; and for each t € T there exists a y 
for which (t, y) € C and ee, y)hasranks @ =1,2,...,n). 
Is it true that every f that is measurable or has the Baire property is analytic? 


The complete answer to this problem is unknown [390]. 


Result 12.5. [390]. Jn Problem 12.4, ifh is continuous and the functions gj are con- 
tinuously differentiable, then every Lebesgue measurable solution f is continuous. 


Proof. This is a consequence of Result 12.2 if we apply it locally. 


Result 12.6. [390]. Jn Problem 12.4, if the functions h and g; are € times continu- 
ously differentiable, then every locally Lipschitz solution f is € times continuously 
differentiable (1 < € < o). 


Result 12.7. [390]. In Problem 12.4, if the functions h and g; are max{2, €} times 
continuously differentiable and there exists a compact subset C of T such that for 
each t € T there exists a y € T satisfying gi(t,y) € C, then every Lebesgue 
measurable solution f is € times continuously differentiable (1 < € < 00). 


Proof. It follows from Results 12.5 and 12.6. 


Result 12.8. [390]. Jn Problem 12.4, ifm = 1 andn = | and the function 
h is p times and the function g, is max{2, p} times continuously differentiable, 
then every Lebesgue measurable solution f is p times continuously differentiable 
(l< p<). 


Proof. The statement is a consequence of Results 12.5 and 12.6. 
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Result 12.9. [390]. Jn Problem 12.4, if equation (12.2) has the special form 


f= Dh y, fei@ y))), 


i=1 


where the functions hj : C x X — R" are p times and the functions g; are max{2, p} 
times continuously differentiable, then every Lebesgue measurable solution f is p 
times continuously differentiable (1 < p < ©). 


12.2 Applications 


A fairly typical situation is that measurability implies continuity. 


1. Let f : RR — R bea solution of (A). With substitution t = x + y, (A) becomes 
f(t) = f(y) + f(t — y). If f is measurable on a set of positive measure, f is 
continuous by Result 12.2, and then from Result 12.9 it follows that f is infinitely 
differentiable. 

2. Let f : RR” > R, h: R"+? = R satisfy 


fa@ty)=h@, y, f%), fO)), xy ER". 


With substitution t = x + y, the equation becomes 


f@=ht—-y,y,f@—y), fO)), 


where / is given and f is unknown. 
Every solution f that is measurable on a set of positive measure is continuous 
by Result 12.2 and, from Result 12.8, has infinite differentiability of /. 
3. The Pexider equation and its generalization. 


Problem 12.10. (Smajdor [756]). Let f,g, and h be real functions and H : 
R* — R acontinuous function, and suppose that 


(12.3) f(x +y) = H(g(x), hA(y)) 


whenever x, y € R. Are the following statements true? 

e Ifgandh have the Baire property, then f is continuous. 

e If gandh are Lebesgue measurable on a set of positive measure, then f is 
continuous. 

e If g orh is Lebesgue measurable, then f is continuous. 


By applying the results above, the assertions above are true [390]. 

4. Now we consider a result in [386]. Let S be a locally compact space on which is 
defined a positive Radon measure yu satisfying certain mild restrictions. There is a 
composition law (s,t) + sot defined for (s, tf) € C C Sx S, and this mapping is 
continuous in C. Let S+, Sz, Sy be three Hausdorff spaces and let G be a mapping 
of Sz x S; into Sf that is continuous on every compact set K C Sg x Sf. 
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Let f, g,h be three mappings of S into S¢, Sz, Sy, respectively. Suppose g 
and are uw-measurable and that 


(12.3a) f(sot)=G[g(s), h(s)] for (s,t) €C. 


Then f is continuous on S. This theorem applies to a large number of functional 
equations commonly considered (see [374]). 

We saw that any continuous solution of (A) is also differentiable, and it obvi- 
ously admits derivatives of all orders. This does not imply, however, that in the 
complex plane a continuous solution of (A) is necessarily analytic. Indeed, 


f(x + iy) = ax + by 


is a continuous solution, and this is analytic if, and only if, b = ia. The same 
phenomenon holds for other equations of the type 


f@1 022) = G(f(z1), f@2)), 21,22 € C. 


On the other hand, if G(u, v) is a symmetric analytic function of u,v and if the 
rule of composition is symmetric and analytic, then analytic solutions may be 


expected. 
Let us add some remarks pertaining to the case 
(12.3b) fat+y)=GF@), fO)I 


with a symmetric function G holomorphic in some domain C = Co x Co in C?. 
Let a be a root of the equation 


G(a,a)=a, aéCo. 


We exclude the case in which this equation is an identity or has no solutions 
in Co. Equation (12.3b) has constant solutions, f(x) = a, where a is any root 
of G(a,a) = a. Suppose now that f(x) is a solution of (12.3b) defined and 
continuous in some interval [0, w] with f(0) = a, f(x) ¥ a. Suppose that 
f (x) € Co and that 

Gi (a,a) =1. 


Then it may be shown that f(x) has derivatives of all orders in [0, w] and the 
higher right-hand derivatives at x = O are uniquely determined by f(O) and 
f'(O). This follows from results in [234], where the theorem is proved assum- 
ing that f(x) has values in a Banach algebra. It follows that if f and g are two 
continuous solutions of (12.3b) and if f(0) = g(0), f’(O) = g’(0), then f and 
g are identical in [0, w]. This does not exclude the possibility that these solutions 
may be extended to the complex plane in two different ways, both extensions 
satisfying (12.3b). At most, one of these extensions can be analytic. 

The case where G is algebraic and x o y = x + y has been examined in great 
detail. Here the analytic solutions have been known since Weierstrass. Contin- 
uous solutions of a real variable have been studied by many authors. Here it is 
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10. 


found that any continuous solution is piecewise analytic. If f(x) is a continuous 
solution, then there is an analytic function g(z) that is algebraic in z or e“* or 
C (bz) and constants a; such that in the kth interval f(x) = g(x + ax). 


. The cosine equation. 


Let G be a locally compact group, H be a topological ring with a countable base, 
and f : G — H bea solution of (C). With the substitution t = ay" (C) is 
f(t) = 2f (ty) f(y) — f (ty). When G is a locally Euclidean group (i.e., a Lie 
group), every Lusin measurable solution of (C) or solutions with the Lusin-Baire 
property are continuous [390]. 


. The Pexider equation (PA), Jensen equation (J), Hosszu equation, 


(12.4) f@+y—xy)+ fay) = fa)+ fO), x,yeER, 
(10.39c) f(y)+e@d—y))+h(A-—x)d —y))+k(1 —x)y) = 0, 


etc., are of the form (12.2). Measurability of f implies differentiability of order 
oo (see also [756]). 


. Abel’s or dilogarithm equation. 


Locally integrable or measurable solutions or those having the Baire property of 
Theorem 11.7 are continuous and have derivatives of all orders (use Result 12.9) 
((233, 33, 440, 460]). 


. Equation from the spectral theory of random fields. 


Measurable solutions of the equation 


f@)A + fa) = f@FA+a)+ f—a)), x,aER, 


are continuous and hence locally bounded [390]. 


. Equation of the duplication of the cube [71, 390]. 


Let J be an open interval of positive length in R and f : J — R* be measurable 
on a subset of J with positive measure and satisfying 


fa)f(px + — p)y)+ fy) f(A — p)x + py) 
= f(px + (1 — py)? + f((L — p)x + py)’. 


Then f is continuous. 

Translation equation (Zdun [840]). 

Let X be a compact metric space f : ]0,oo[ xX — X, and suppose that f 
satisfies the translation functional equation 


(12.5) futo,x) = flu, f(v,x)), u,v €]0, oof, x € X, 


and f is measurable with respect to the first variable and continuous with respect 
to the second one. Then / is continuous. 

Guzik extended this result proving a “measurability implies continuity” type 
result for the more general equation 


futo,x)= Shia, x), filu, bi(v,x))), u,v €)0, oof, x € X. 
i=l 
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12.3 Christensen Measurability 


Background 


It is well known that any Lebesgue measurable solution f : R — R satisfying 
Cauchy functional equations (A) and (E) has to be continuous. This has been proved 
by many authors (see Chapter 1). This type of result (“measurability implies conti- 
nuity”’) is very typical; indeed it occurs with respect to many well-known and impor- 
tant functional equations (e.g., the functional equation characterizing polynomials 
(Chapter 7), trigonometric functional equations (Chapter 3), and Jensen’s equation 
(Chapter 1)). In the majority of cases, the property “measurability implies continu- 
ity” is shared by solutions defined on a locally compact Abelian group; in that case, 
measurability is understood in the sense of a completed Haar measure. However, in 
many instances, local compactness turns out to be a pretty restrictive requirement; 
in particular, it eliminates all infinite-dimensional Banach spaces. However, in the 
early 1970s, it was shown in Christensen [159] that the concept of a zero set in the 
sense of a Haar measure can be generalized to the case of any Abelian Polish topo- 
logical group. It turns out that in many respects these groups behave like locally 
compact groups, and various basic theorems (suitably modified) from abstract har- 
monic analysis carry over to this case (see [173]). 

We now give some notation and terminology. Let X (+-) denote an Abelian Polish 
group and let D(X) be the o-field of all Borel subsets of X. By M,(X) we denote 
the completion of 6(X) with respect to a given measure uw : B(X) — [0, co]. The 
members of the o-field 


M(X) = O{M,(X) : «is a probability measure on B(X)} 


are called universally measurable sets. Put Ho(X) := {C € M(X) : for some 
probability measure uw on M(X) one has u(C + x) = 0 forall x € X}; any element 
C of Ho(X) is called a Haar zero set in X, and a corresponding probability measure 
is named the testing measure for C. 

Let Co(X) := {B C X : there exists an A € Ho(X) such that B Cc A} and 
C(X) := {CUB:C € M(X) and B € Co(X)} [279]. 

Any set C € Co(X) is called a Christensen zero set, whereas the phrase “C C X 
is Christensen measurable” will stand for “C € C(X)”. Therefore, any Christensen 
measurable set is a union of a universally measurable set and a Christensen zero set; 
the latter is simply any subset of a Haar zero set. 

A mapping f from X into a topological space Y is said to be Christensen mea- 
surable provided that f~!(U) € C(X) for any open set U C Y. 

Needless to say, the notion of a Christensen measurable mapping reduces to 
the classical Haar measurability in the case where the topological group considered 
happens to be locally compact. 

A presentation of several new results in the spirit “Christensen measurability 
implies continuity” valid for solutions to various classical equations and inequalities 
is presented in [308]. 
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Christensen Zero Sets and the Ideals 


Let G(+) be a group (not necessarily commutative). A nonempty family J C 
2°\{G} is called a proper linearly invariant set ideal (respectively o -ideal) if, and 
only if, J is closed under finite (respectively o-finite) set-theoretic unions, heredi- 
tary with respect to descending inclusions, and such that x — C € T provided that 
xeGandC eT. 

The family of all first category subsets of a second category topological group 
and the family of all sets with zero Haar measure is an Abelian locally compact 
group yielding the pattern examples of proper linearly invariant (abbreviated to p.1.1. 
in what follows) o-ideals (see, for example, Dhombres and Ger [221] and Kuczma 
[558]). 

For T ap.l.i. ideal (o-ideal) in a group G, define 


x(T):={N C G?:N C(G x U)U(U x G) for some U € T} 


and 
Q(T) := {N C G*: M, €T for T — almost all x € G}. 


Result 12.11. (Ger [308]). In any Abelian Polish group, the family of all Christensen 
zero sets forms a p.l.i. o -ideal. 


Christensen Measurability Implies Continuity 


Now, we may ask whether Christensen measurability provides the results in the spirit 
described above, i.e., measurability implies continuity. Actually, it does. This may 
also be considered a good argument that the notions presented above are well moti- 
vated. 


Result 12.12. [279]. Any Christensen measurable homomorphism from an Abelian 
Polish group into another one is continuous. 


Result 12.13. [279]. Let X be a real linear Polish space and let f : X > Rbea 
Jensen-convex functional; i.e., 


x +) _f@+F0) 


5 < 5 forallx,yé€X. 


(JC) f ( 


If f is Christensen measurable, then it is continuous (and hence also convex). 


Result 12.14. [297]. Let C be a nonempty open and convex subset of a real linear 
Polish space X (+; -) and let S C X be acone such that SU(—S)U{0} = X. Finally, 
let Y (|| - ||) be a Banach lattice with partial order < . If f : C — Y is a Christensen 
measurable mapping such that for some positive integer n one has 


n+l 


Sept (" ‘re + jh)>0 


j=0 


forallx € C andh € S for which x + (n+ 1)h € C, then f is continuous. 
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Result 12.15. [294]. Zf Y(|| - ||) is a separable (real or complex) Banach space and 
H:Y xY — Y is such that 


| (s1,¢) — H(s2, tll > a(llsi — s2lw@), s1,582,t € Y, 


where w : Y — (0,0) and a : [0,00) — [0, 00) are certain functions with 
o(0) = 0, o(t) > Ofort € (0,00), @ increasing, and if X is an Abelian Polish 
group, then any Christensen measurable solution to (12.4) is continuous. If, more- 
over, the composition w o f is bounded away from zero, then f is uniformly contin- 
uous. 


12.4 Functional Equations (Characterizing) from Trigonometric 
Functions 


Theorem 12.16. (Ger [308]). Let X be an Abelian Polish group. Then any 
Christensen measurable solution f : X — C to the d’Alembert equation (C) is 
continuous. 


Proof. From [424, 419], it follows that either f = 0 or f is of the form 


1 1 
(12.6) fa=5 (10 fe =) , 22%, 


where h : X — C\{0} is an exponential function (E). 
From the proof in [424], two cases arise: 


a) f(x) Cc {-1, 1}. Thenh := f. 
(ii) f(x0)* € 1 for some xo € X. Then 


A(x) := f(x) + [f@ + xo) — f@)FGo)], xEXx 


1 
V f (xo)? - 1 
(no matter which of the two roots is fixed). Therefore, if f is Christensen measurable, 
then so is h. Applying Result 12.12, we infer that 4 is continuous. Consequently, in 
view of (12.6), f is continuous, which was to be proved. 


Result 12.17. [308]. [f X is an Abelian connected Polish group and f : X > Risa 
Christensen measurable and bounded nonzero solution of (C), then f is a real part 
of a continuous character of the group X. 


Proof. We have f (0) = Oor f (0) = 1. The first possibility is excluded since it forces 
f to be zero, which contradicts the assumption. Therefore, f(0) = 1. From [424], f 
has the form (12.6), where is a homomorphism of X. By virtue of Theorem 12.16, 
f is continuous, and no matter which of the two cases (i) or (ii) from the proof 
occurs, this forces h to be continuous. Since f is supposed to be real, we get 


flay = 5 (hy + A) = 5 (Ta + —) gex 
~ 2 h(x) h(x) , 
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Obviously, h is a homomorphism as well; hence f admits two representations of the 
form (12.6). A repeated appeal to [424] yields 


(iii) h=h 
or 

~\Z_ 1 
(iv) h=j;. 


Assume (iil). Then / is real and, since 


1 1 
1=fO=5 (H+ =): 
one has h(0) = 1 > 0. Now, the continuity of / and the connectedness of X imply 
that h(x) > 0 for all x € X. Consequently, f = 1 and the assertion holds true in a 
trivial manner. 
Assume (iv). Then |h| = 1; i.e., 4 is a continuous character of the group X. 
Moreover, 


1 1 1 — 
f= 5 (: + *) = 5h +h) = Reh, 


and the proof is complete. 


Remark 12.17a. The assumption of the connectedness of X may be replaced by 
the requirement of 2-divisibility of X. In fact, the only place we have used the 
assumption that X is connected was to show that / is positive (in case (iii)). If X is 
2-divisible, then the positivity of h may be derived as follows: 


l ry 
h(x) =h{2-=x)=hl =x} >0, xeEXx 
2 2 


(5x need not be unique). 


Remark 12.17b. In [652], the result above was proved under the additional strong 
assumptions that (a) f is continuous and (b) X is locally compact (see Chapter 3). 


Let us now consider the sine functional equation (S). The behaviour of its 
solutions is somewhat more troublesome than that of solutions of (C). In particu- 
lar, additive functions are solutions to (S) (see Chapter 3, Theorem 3.43). Moreover, 
some further technical difficulties occur in the case where the domain of the function 
considered is not 2-divisible. 


Result 12.18. [308]. Let X be a uniquely 2-divisible Abelian Polish group. Then 
any Christensen measurable solution f : X — C of (S) is continuous. 


Proof. Any solution f : X — C of (S) is either additive (i.e., satisfies Cauchy’s 
equation (A)) or of the form 


(12.7) fey =e(my-), xeXx, 


where h : X — C* is an exponential function (E). 
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In the first case, it suffices to apply Result 12.12. If the latter case occurs, then 
additionally, for an a € X such that f(a) 4 0, the function 


(12.8) g(x) = ——— TH ) [fa~+a)—fa-a)], xeEXx, 
has the form 
(12.9) g(x) = ; 5 (Hex) + a). xeXx. 


Therefore, g yields a Christensen measurable solution to equation (C), whence, in 
view of Theorem 12.16, g is continuous. This forces h to be continuous (see the 
proof of Theorem 12.16). An appeal to formula (12.7) completes the proof. 


Result 12.19. [308]. Jf X is a uniquely 2-divisible Abelian Polish group and 
f : X — Risa Christensen measurable and bounded solution of (S), then f is 
proportional to the imaginary part of a continuous character of the group X. 


Proof. Let f : X — R be a Christensen measurable and bounded solution of (S). 
Without loss of generality, we may assume that f # O (the zero function is the 
imaginary part of the trivial character h(x) = 1, x € X). By virtue of Result 12.18, 
f is continuous. Moreover, f is nonadditive; in fact, otherwise, being bounded, 
f would have to be zero, contrary to the assumption. Consequently, the function 
g defined by (12.8) is continuous and has the form (12.9) with some continuous 
exponential function h : X — C*, whereas f is given by (12.9). An appeal to [424] 
allows one to state that for any other exponential function k : X — C* such that 


g(x) = : (kos os ). xe x, 


one has ; 
k=h or k=-. 
h 


Thus, if 
fix)=d (koa = =). rex, 


for some exponential function k, then 
1 
(k=handd=c) or k= 7 add =—e ‘ 
h 


Since f is real, we obtain the equality 


—— 1 
f(x) =e (ix) - iG =) = (i - a) ; 
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valid for all x € X. Thus 
= — 1 
(h=handc=c) or (i =F andc = -c) ; 
h 


In the first case, both A and c are real. In view of the 2-divisibility of X, we infer that 
h is strictly positive (cf. Remark 12.17a). Since f is bounded, so is g, on account of 
(12.8). Proceeding as in the proof of Result 12.18, we get h = 1, which leads to a 
contradiction (f = 0). Therefore, the other possibility must occur; Le., 


This means that / is a (continuous) character of the group X and c = fi for some 
B € R. Now, 


f(x) = ip(h(x) —h@)) = —2BImA(x), x € X, 


which was to be proved. 


Remark. In the proofs of Theorem 12.16 to Result 12.19, the Christensen mea- 
surability assumption has been used indirectly by appealing to Result 12.12. Thus, 
for any kind of measurability forcing the continuity of group homomorphism, these 
results remain valid as well. 


Christensen Almost Additivity 


P. Erdés [260] asked whether a function f : R — R fulfilling (A) almost everywhere 
with respect to Lebesgue plane measure has to coincide with a true additive function 
almost everywhere in the sense of Lebesgue linear measure. An affirmative answer 
to that question was given in [406] and [136]. This question was next raised and 
answered in connection with various other functional equations and inequalities (see 
[558, 221, 304]). All these results are established in terms of abstract p.1.i. ideals 
having the “Fubini property”. In particular, the following two results are special 
cases of the results obtained in [136] and [331]. 


Result 12.20. [308]. Let X be an Abelian Polish group and let Y be any group (not 
necessarily commutative). If f : X — Y is amapping such that 


fx +y)=f@)+ f(y) for Q(Co(X)) —almost all (x, y) € X?, 
then there exists exactly one homomorphism a: X — Y such that 
f(x) =a(x) forCo(X) —almostallx € X. 


Result 12.21. [308]. Let X and Y be as above. If f : X — Y is a mapping such 
that 


faty=f@)+fQ) fora(Co(X) —almost all (x, y) € x. 


then f is a homomorphism; in other words, if relation (A) holds for all x,y € X*, 
where U € Co(X), then (A) is satisfied unconditionally in X?. 
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Similar results may also be obtained for polynomial functions, d’ Alembert’s and 
sine functional equations, functional equations of Mikusinski, the Pexider equation, 
and Jensen convex functions. 
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Difference Equations 


In this chapter, we deal with several difference equations arising from the well- 
known Cauchy, Pexider, quadratic, cosine, and other functional equations. Cauchy 
difference, difference that depends on the product, the Pompeiu functional equation 
and its generalizations, quadratic difference, and Pexider difference are treated. 


13.1 Cauchy Difference 


Let f : G — H, where G is an Abelian group and H is a divisible Abelian group. 
Then 


(CD) fa@+y)—f@)—f), forx,yEeG, 
is called the Cauchy difference. It is easy to verify that 
(13.1) d(x, y) = fx +y)— f(x) — fo), forx,y € G, 


satisfies the so-called cocycle equation 


(13.2) d(x, y)+Oat+y,2)=O@,y+z2)+00,2), forx,y,z €G, 


which has been studied by many authors, including G. Galbura, Th. Anghelutza, 
M. Ghermanescu, I. Stamate [764], M. Hosszi [380], T.M.K. Davison [209], 
M. Kurepa [243], J. Erdés [259], and J.D. Aczél [395, 509]. The equation (13.2) 
has occurred in different fields, including homological algebra, the Dehan theory 
of polyhedra, statistics (Daykin and Eliezer [216]), and information theory. It was 
proved in Kannappan and Sahoo [509] that all differentiable solutions of (13.2) are 
of the form (13.1). 
Put y = 0 in (13.2) to get 


(13.3) o(x,0) = d(0,z) =constant=c forx,z€EG. 
Partially differentiate (13.2) with respect to z. Then 


g2o(x+y,z) = do, y +z) + h2(y, z). 


Pl. Kannappan, Functional Equations and Inequalities with Applications, 511 
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Set z = 0 to get fdo(x + y,0) = do(x, y) + d2(y, 0), which after integrating with 
respect to y yields 

px, y)=s@t+y)— sly) +A). 
Letting y = 0 in this equation, we have 

p(x, 0) =c = g(x) — g(0) +h), 


that is, 


o(x,y)=g@+y)—agly)— g(x) +c4+ g(0) 
=faty)—fQ)— f@), 


where f(x) = g(x) —c — g(0), which is (13.1). 
Result 13.1. [259]. A solution of (13.2) is of the form (13.1) if and only if ¢ is a 


symmetric function, 1.e., 


(13.1a) (x, y)=¢0,x) forx,yéEG. 


Based on this, the following result is proved that at the same time weakens the 
condition of differentiability of ¢ to continuity. 


Result 13.2. [259]. Any continuous solution ¢ : R x R = R of (13.2) is of the form 
(13.1). 


Proof. The proof presented here is due to J. Aczél. From Result 13.1, it is enough to 
show that ¢ is symmetric when ¢ is continuous. 
Define g: R x R— Rby 


(13.4) g(x,y) =, y)—P(y,x) forx,yeER. 
We will show that g = 0. Obviously 

(13.4a) g(x,y)=—g(y,x) forx,yER. 

Interchanging y and x and y and z separately in (13.2), we have 


Oy. x) +O +%x,z)=O0,x4+2)+ 4,2), 
P(x, 2) +OM+2,y)=OO@,72+y) +, y). 


Adding these two equations and subtracting (13.2), we obtain 
(13.5) g(xt+zy)=8,y) +8. y) forx,y,zER. 


For fixed y, (13.5) is an additive equation in the first variable. Since ¢ is continuous, 
so is g, and by Theorem 1.1 we get 


g(x,y) =c(y)x for some c(y). 
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c(y)x =—c(x)y or  --) forx,y (4 0)ER. 


Thus c(y) = Oand g(x, y) = 0 for y 4 0. But, by (13.4) and (13.3), g(x, y) = 0 for 
y = Oalso. 

Hence g(x, y) = 0 for all x, y € R, resulting in ¢ being symmetric by (13.4). 
This proves the result as stated. 

It is proved in Jessen, Karpf, and Thorup [395] that 6: G x G — H defined by 
(13.1) is characterized by the system of functional equations (13.2) and (13.1a); that 
is, by the cocycle equation and symmetry. The general solution of (13.2) is obtained 
in the following remark (see also Hosszti [380]). This equation plays a central role 
in the theory of functional equations of several variables and has many applications 
(e.g., in the theory of group extensions). 


Remark 13.3. Suppose ¢ : R x R > Ris a solution of (13.2). Then d(x, y) = 
S(x,y) + Bx, y) for x, y € R, where S is a symmetric solution of (13.2) and B 
is a biadditive, antisymmetric (B(y,x) = —B(x, y)) solution of (13.2). Note that 
S(x, y) has the form (13.1). 


Define 
1 
S(x, y) = 5 Oe, y) + d(y,x)), 


1 
BG, y)= 546, y)—$0,%)), forx,yeR. 


It is easy to verify that S and B satisfy (13.2). Now we show that B is additive in 
the first variable. 


2Bixt+y,z3=O@+y,2)-P@, x+y) 

= $(%,y+z)+ 0,2) —$(, y) 
—[@@+x,y)+¢@,x)—¢,y)] by (13.2) 

= $(«,y +2) +4(0,2) -—[$@ +2, y)+4G,x)] 

= $(x,y +z) + O(y,z)-[OG, y +z) +, y)—O@, z)]-$&, x) 
(interchange y and z in (13.2)) 

= $(x, 2) — $@,x) + 0,2) — $@, y) 

= 2B(x,z)+2B(y, z); 


that is, B is additive in the first variable. So, B is biadditive since it is antisymmetric. 
If ¢ is not symmetric, (13.2) may have a solution different from (13.1). 


Remark 13.4. [259]. That (13.1) is not the general solution of (13.2) can be seen in 
the following example. 

Let {h1,h2,h, (v € T index set)} form a Hamel basis of real numbers. Let 
x,y,z € Rwith 
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x =rjhy + sjhz+ Shy: 


vel 
y =roh, + soho + hes 
vel 
z=r3hi t+ s3ho+ Yhes 
vel 


where rj, sj (@ = 1,2,3), ry, r/, and ry’ are rationals; all are zeros but for a finite 
number. 
Define 
o(x, y) =r1s2 — 125]. 


Evidently, ¢@ satisfies (13.1) but is not a symmetric function, so ¢ cannot be of the 
form (13.1). 


Theorem 13.5. (Swiatak [784]). The necessary and sufficient condition for the equa- 
tion 


(13.6a) d(x, y)= f(x +y)—g(x), forx,y €G, 


admits a solution f, g : G — H such that the function ¢: G x G > H fulfills the 
equation 


(13.6) P(x, y+z)+O(,x) =00,x+2)+0,y), x,y,2EG. 
Proof. Evidently, (13.6a) satisfies (13.6). Conversely, let (13.6) hold. Setting y = 0 
in (13.6), we get 
P(x, z) = $(0,x +z) + O(x, 0) — (0, x) 
= f@ +z) —u(x), 
where f(x) = (0, x), u(x) = (0, x) — d(x, 0). This proves the result. 
Theorem 13.6. The necessary and sufficient condition for the equation 
(13.7a) d(x, y)= fx +y)—8@)—-hQ), forx,yeEG, 
to have a solution f, g,h : G — H is that the function ¢ : G x G > H satisfies 
the relation 
P(Z,x +y)—b@, y+z)+46@,y¥)— o(y,z)-¢0,x + y) 

for x,y,z € G. Further, if (f, g,h) and (f\, g1, 1) are solutions of (13.7a), then 

f@) = fi@)+A@)+atb, 
(13.7b) g(x) = 81%) + AG) +4, 

h(x) = h(x) + A(x) +B, 


(13:7) 


where A is additive and a, b are constants. 
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Proof. It is easy to verify that ¢ given by (13.7a) is a solution of (13.7). Conversely, 
suppose (13.7) holds. Then (13.7) can be rewritten as 


(13.8) a(z,x+y)+a,y)=a,y+z)+aQ, 2), 
where 
(13.8a) a(x,y) = (x, y)—¢0,y)—¢(,0) forx,y,zEG. 


Now (13.8a) yields a(x, 0) = —(0, 0) = (0, y) = constant. Then x = 0 in (13.8) 
gives a(z, y) = a(y,z); that is, a is symmetric. Indeed, (13.8) is (13.2). From 
Result 13.1, there is ak : G — H such that 


a(x,y)=kawt+y)—k@) —-kQy), 


which with (13.8a) gives (13.7a). 
If Cf, g, 4) and (f{, g1, 21) are solutions of (13.7a), then 


fe +y)— fi@t+y) = eg) —-g1@)+hly)-M0), 


which is Pexider (PA). Equation (13.7b) now follows. 
When (13.7) holds, the set of solutions of (13.7a) is given by (13.7b). This proves 
the result. 


A generalization of the cocycle equation (13.2) is the equation 


(13.2a1) pi(p,q) + d2(p +4,7r) = b3(p,r) + da(p +1,¢) 


(which arises in the determination of all branding measures of inset information on 
open domains) by reducing it to two other similar equations, 


kK(p,q) + k(p+ qr) =h(p,r) +k(p+r,q) 


and 
fPpat+tgpt+an+fpr+geptrg) =9, 


where ¢;, f, g, h, k : D2 > R, Dn = {(pi,.--, Pn) : pi € J, for all 
i, >, pi € Io}, Io = JO, 1[”, and (p, g,r) € D3. The general solution of (13.2a) 
is given by (Ebanks [237]) 


pi(p, 9) = a1(p) + Big) — a2(p +9) + (p,q), 
f2(p, 4) = a2(p) + b2(q) + 7 (Pp +49) — wD, 49), 
$3(p, 4) = ai(p) + f2(q) — a3(p +g) — w(p. 9), 
pa(P, 4) = a3(p) + Pilg) +7 (p+ 4) + v(p.@), 


where a;,fj,y : J ~ R@ = 1,2,3; j = 1,2) are completely arbitrary and 
y : R” x R” — Ris an arbitrary skew-symmetric and biadditive map. 
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The most general “cocycle” equation is 
(13.9) Fix+y, z)+Fo(ytz,x)+F3t+x, y)+ Fax, y)+Fs(y, z)+ Foz, x) =0 


with six unknown functions F; :G x G > X (i = 1,..., 6). Here G is an arbitrary 
Abelian group and X is a vector space over the rational field. The general solution 
of (13.9) is presented in the following result. 

Result 13.7. (Ebanks and Ng [243]). The general solution F; : G2 > X (i = 
1,..., 6) of (13.9) is given by 


Fi(x, y) = Aix, y) + fax) — (fs + f(y) + fro + y) 


(13.9a) 
= Bie + ¥, 7); 
(13.9b) Fo(x, y) = Bix, y) + fox) — i+ fa)(y) + fir + y) 
= Boe +y,y); 
(13.9¢) 02 = C169) + HO) — G2 + 90) — Hot FI + ¥) 
+ Bo(x + y,x), 


(13.94) Fu@, y) = —-Bily.x) -Cii,y)+fi@)+ fo) - footy), 
(13.9e) -F5(x, y) = —Ci(y, x) — Ai(x, y) + fax) + fa(Qy) — fo +), 
(13.9f) Fe(x, y) = —Aily,x) — BiG, y) + foxx) + fay) — fo +y), 


for arbitrary maps A,, B,C, : G* — X additive in the first variable, arbitrary 
map By : G* -> X additive in the second variable, and arbitrary maps f; : G > 
X G@=1,...,11). 


13.1.1 Differences that Depend on the Product 
In lattice theory, one considers functions f that satisfy the equation 
flav b)+ farb) = f@+t fe) 
for all pairs a, b. If one applies the substitutions 
aVb=a+b—ab and aAb=ab 


for some transforming lattice concepts to Boolean rings, one has the functional equa- 
tion 


(13.10) fa +y—xy)+ fay) = f@)+ fO), 


known as the Hosszt functional equation, which is connected to the additive function 
(Kuczma [558]) treated by many others [125, 195, 739, 782, 573, 240]. 
Hosszt determined the differentiable solution as 


(13.11) f(x) =ax +b, 


where f : R > R. In Daréczy [195, 197], the integrable solution of (13.10) on Jp is 
obtained as (13.1). In Swiatak [782], the following results are proved. 
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Suppose f : R > R satisfies (13.10). Then: 


(i) If d(x) = f(x) — f (0) is odd, (13.10) can be reduced to Jensen’s equation (J); 
that is, (13.10) <=> (J). 

(ii) If f is continuous at 0 or | or at points a and a + + — 1, then f has the form 
(13.1). 

(iii) If f is integrable on ]0, e[, then f is given by (13.1). 

(iv) In [125], it is stated that any solution f : R —> R of (13.10) is (J); that is, 
f(x) = A(x) + b, where A is additive (no assumptions made). The same 
result holds for x, y € C also. 

(v) In [197], the same result is obtained by connecting to (13.2) as follows. 
Suppose f : R > Risa solution of (13.10). Define 


d(x, y)= f(x)+ fo)—f@y), forx,y eR, 


and obtain 


(13.2a) p(x, y) + b(xy,z) = 60,2) + OG, yz), 


which is cocyclic and, since d(x, y) = f(x + y — xy), with z = a becomes 


f(sy4—-x)4foty-my=sO4s(y4+2-1); 


that is, 
Jolie hsuludh secabe 


by the substitutions xy ++ ~—-x=u+1,x+y—xy=v+1. Thus A(x) = 
f(x+1)—f (1) satisfies (A) forall (x, y)€ S={(x,y):x+y>0orx =0= 
y orx+y-+4 < 0}. To prove that A is additive on R, letu,v € Rand (u,v) ¢ 
S. Then there exists an x such that (x, uv), (xn+u,0v), (x,u+v) arein S owing 
to the fact that, for fixed (u,v) ¢ S, the inequalities x +u > 0, x +u+v>0 
can be made to hold for x. But then A(x +u) = A(x) + A(u), A(x +u+o) = 
A(x +u)+A(v), and A(x +u+v) = A(x) +A(u + v). It follows from these 
equations that A(u+v) = A(u)+ A(v) for (u, v) ¢ S also. Thus A is additive 
on R and the result follows. 


Let G be an Abelian group and F be a field. Let B(R, G) be all functions f : 
R — G satisfying (13.10). Then B(R, G) is an Abelian group. 

Let F be a field with more than five elements. Then f € B(F, G) if and only if 
f is affine (f(x + y) + f(O) = f(x) + f(y)); that is, f satisfies (J) (see Davison 
[208, 209]). The structures of B(R, R), B(C, C), B(F, G), and B(R, G) are known. 

Let d : Z > Z (i = 1,2, 3, 4) be defined by dj (n) = 1, ¢o(n) =n, $3(n) = 
[-3], and ¢4(n) = [—§]. and left ds : Z — Zz be defined by 


1, n=2mod4, 
ps(n) = 
0, n#2mod4. 
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Result 13.8. [208, 209]. f € B(Z, G) ifand only if there exist c, to c5 with 2c5 = 0 


such that 
5 


f(x) = > cidi (x). 
i=1 

Let F be a field and G be an Abelian group. In [227] was characterized all real- 
valued functions f such that f(xy) — f(x) f(y) depends on the sum x + y for all 
x,y € R. Our aim now is to characterize all functions f : F — G for which the 
Cauchy difference (CD) depends only on the product xy for all x, y € F. Further, 
we apply this characterization to determine all solutions of a generalized Hosszt 
equation 


(13.10a) f@tytaxy)+ gy) =ha)+ky), x, yEF, 


where a € F is a parameter chosen a priori. Notice that when a = —1 and f = 
g = h = k, the functional equation (13.10a) reduces to (13.10). If we assume 
a=0, f =—h = -k, and g = a, then (13.10a) reduces to 


(13.12) fa +y)—f@)-—fQ)=ary) forx, ye F. 


Ifa=1,h=k = —g = f, (13.10a) becomes the Pompeiu functional equation 
(Pompeiu [671, 672]) that characterizes field endomorphisms (see the next section). 

First we consider (13.12). Observe that the right-hand side of (13.12) is a Cauchy 
difference, so it satisfies the cocycle equation. That is, defining ¢ : F x F > G by 


(13.1) x,y) :=faty)—-f@—-fO), xyEF, 


we have by this definition the cocycle equation (13.2). By (13.1), (13.2), and (13.12), 
a must satisfy 


a(xy) +a(@ + y)z) = a(x(y +2)) + alyz), x,y, 2€ F. 
Setting y = —x, we obtain 
a(—x”) +a(0) = a(—x? t+xz)+a(—xz), x,yeF. 
If x 4 0, then from the equation above with z = —x~'t, we get 
a(—x*) + a(0) = a(—x? -—t)+a(t), xe F*, te F. 


Next, defining A : F + G by 


(13.13) A(x) :=a(x)-—a(0O), xe F, 
we have 
(13.14) A(=x" —t) = A(x”) — AQ), x € F*, te F. 


Next, we solve the functional equation (13.14). Afterwards, we can easily obtain 
the general solution of (13.12). 
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Result 13.9. (Ebanks, Kannappan, and Sahoo [240]). If F is neither Z2 nor Z3 nor 
an infinite field of characteristic 2, then every solution of (13.14) is additive. 


The sharpness of this result is illustrated in the following examples. 


Example 1. Let F = Z2, G = R, and A: Z2 — R be defined by A(O) = O and 
A(1) arbitrary but nonzero. Then A satisfies (13.14) but is not additive. In fact, it 
can be easily verified that the general solution of (13.14) on Z2 is any map A that 
satisfies A(O) = 0. 


Example 2. The general solution of (13.14) on Z3 is A(O) = 0, A(1) = 4, and 
A(2) = 24 for arbitrary 2 € G. If 34 ~ Oin G, then A is not additive. 


Example 3. Let F = Z2(x), the field of rational functions in x over Z2. Then F 2isa 
subfield of F since s?-+t* = (s+t)* and s*t? = (st)?. If we define A : Z2(x) > Z2 


by 
0, ifs € F?, 
A(s) = 4 
1, ifs € F\F’, 


then A satisfies (13.14) but is not additive. Indeed, in this situation, equation (13.14) 
reduces to 

Aw +t)=A(t), ue F*, teF. 
If t € F*, thenu* +t € F? and A(u* +t) =0= A(t); ift € F\F’, thenu? +t € 
F\F*, so A(u? + t) = 1 = A(t). This shows that A satisfies (13.14). But A is not 
additive because x, x3,x +x? € F\F’, and A(x) = A(x?) =041= A(x 4x3). 
Theorem 13.10. [240]. Suppose F is a field and G is a uniquely 2-divisible Abelian 


group. If F 4 Z2 or Z3, then the maps a, f : K — G satisfy (13.12) if and only if 
they have the form 


(13.15) a(x) = 2A1(x) +c, xeéeF, 
(13.15a) f(x) = A(x?) + An(x) —c, x € F, 
for arbitrary additive maps A,, Az : F — G and arbitrary constant c € G. 


Proof. By (13.14) and Result 13.9, the map A : F — G defined by (13.13) satisfies 
(13.14) and is additive. Call this map 2A 1. Then (13.11) gives us 


a(x) =2A\(x)+c, x EF, 
where c = (0). This is (13.15); substituting this into (13.12), we obtain 
f(x +y)— f@)-fO) =2Al@y)+c, x, ye F. 
On the other hand, the equation above can be rewritten as 


faty)— f@)— f(y) =2A1@y) +e 
= Aj(2xy) +c 
=Ai(at+y)/—x?-y)+e 
= {Ai((x + y)*) —c} — {Ar (x?) — c} — {A (y?) — c}. 
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From this we see that the map Az : F —> G defined by 
Ao(x) = f(x)- AiG’) +e, x EF, 


is additive. This provides (13.15a). The converse is easily verified and completes the 
proof of the theorem. 


Remarks. Theorem 13.10 is sharp; it is false for F = Z2 or Z3. The following 
statements are easily verified. 
The general solution a, f : Z2 — R of (13.12) is given by 


1 
a(x) =dx+c, f(t) = —ydx —e, x € Zp, 


for arbitrary real constants c, d. This will not be of the form (13.15), (13.15a) unless 
d=0. 
The general solution a, f : Z3 > R of (13.12) is given by 


a(x)=dx+c, x €Zs3, 


=e ifx = 0, 


fO)= em if x € Z3\{0}, 


for arbitrary c,d € R. Again, this will not be of the form (13.15), (13.15a) unless 
d=0. 

The next result is a rephrasing of Theorem 13.10 and characterizes functions 
whose Cauchy difference depends on the product of arguments. 


Corollary 13.10a. Suppose F is a field and G is a uniquely 2-divisible Abelian 
group. Let f : F — G bea function. If F 4 Z2 or Z3, then the Cauchy difference 
f(x+y)— f(@) — f(Q) depends only on the product xy for allx, y € F if and only 
if f is of the form 

(13.15a) f(x) = Ai’) + Aaa) +e, 


where A,, Az : F > Gare additive maps and c is an arbitrary but fixed element in 
G (see also Ecsedi [247)). 


This result is extended in the following corollary. 


Corollary 13.10b. [240]. Maps f, g,h,k : F — G satisfy the functional equation 
(the Pexider difference depends on the product xy) 


(13.12a) f@ +y)— g@) —h(y) = ky) 


for all x,y € F, where F is a field different from Z2 or Z3, and G is a uniquely 
2-divisible Abelian group, if and only if f, g, h, and k are given by 


f(&) = Ai?) + Ag(x) +4, 

g(x) = Aj (x?) + Ao(x) — bi, 
h(x) = Aj (x7) + Ao(x) — bo, 
K(x) = 2Ai(x) +b1 + bo +4, 


(13.15b) 
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where A,, Az: F — G are additive functions and a, b, bz € G are constants (see 
also Kannappan and Sahoo [509)). 


Proof. Letting x = 0 and y = 0 in (13.12a) separately, we obtain 


(13.16) = A(y) = Ff) -—g@)—kO) and g(x) = f(x) —hO)— kO), 


respectively. Note that g and h differ from f by a constant. Inserting (13.16) into 
(13.12a), we get (13.12), where 


a(x) := k(x) — g(0) — h(O) — 2k(0). 


By Theorem 13.10 and (13.16), we obtain the asserted solution with by := h(O) + 
k(O) — a and bz := g(0) + k(0) —a. 


Theorem 13.11. [240]. Suppose that K is a ring with unity and G is a multi- 
plicative group. Let f : K — G be a function for which the Cauchy difference 
f(xy) f(v)7! f x)! depends only on the sum x + y for all x,y € K. Then f is 
constant. 

The same conclusion holds if we suppose that f (xy) f (x)~! f (vy)! depends only 
onx+y. 


Proof. First, suppose that 
(13.17) FEY FOY'FG)' =A ty), xy EK, 
for some map h : K — G. Putting y = 0 here, we get 

fay t=h@), xeK, 
and with this (13.17) becomes 

FEYFOY FO) = fet, xy ek. 
The last equation can be written as 
fatwfay=f@FO), x yEK. 

Putting x = 1, we obtain f(1 + y) f(y) = fd) fG); that is, f+ y) = fC) for 


all y € K. Hence f is constant. 
The proof is completely analogous if 


fry) fx) FO)! =hA@+y) 


holds instead of (13.17). 
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Solution of (13.10a) 


Theorem 13.12. [240]. Suppose that F is a field with at least five elements and that 
Sf, g,h,k : F — G, where G is an Abelian group, satisfy the functional equation 
(13.10a) for some fixed a = F*. Then, and only then, we have 


f (x) = —A(ax) +4, 

g(x) = A(a*x) +a +b; + by, 
h(x) = —A(ax)+a+b,, 
k(x) = —A(ax)+a+bo, 


(13.18) 


where A: F — G is an additive function and a, bj, bz € G are arbitrary constants 
(see also [509}). 


Proof. Letting y = 0 and x = 0 in (13.10a) separately, we see that 
(13.19) A(x) = f(x) +g@)—kO) and ky) = fy) +8) —hO), 


respectively. Note that h and k differ from f by a constant. Using (13.19) in (13.10a) 
yields 


(13.20) f(x +y +axy)+ gy) = f@)+ fy) + b1 + bo, 
where 

bi} = g(0)—k(O), by =g()—hO). 
Substitution of y = —t (recall a 4 0) in (13.20) gives 
(13.21) g(x) = f(—ax) +b, + bo. 
Hence, by (13.21), (13.20) becomes 
(13.22) fat+yt+axy)+ f(-axy) = f(x) + fQ) 


for all x, y € F. Since a ¥ 0, replacing x by —+ and y by -2 into (13.22) and 
defining 


(13.23) w(x) := f (=): 
we obtain 
(13.10) wx +y—xy)+ wry) = wa) + vw) 


for all x, y € F. Using Result (111) from above, we obtain w(x) = A(x) + u, where 
A is additive. From this and (13.19), (13.21), and (13.23), we obtain the form of 
f,g,h, and k as asserted in (13.18). This completes the proof of the theorem. 
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13.1.2 Pompeiu Functional Equation and Its Generalizations 


Let G = R — {1}. Then G(o) is an Abelian group, where the group operation o is 
defined as 
xoy=x+y4+xy, x,yeEG. 


The difference f (xo y)— f(x) — f(y) is a function of x, y, say a(x, y). Ifa(x, y) = 
f (x) f(y), we obtain the functional equation 


(13.24) faty+xy)=f@)+fO+f@FO) forx,y ER, 


known as the Pompeiu functional equation [671, 672, 515, 646], which characterizes 
field endomorphisms in fields with more than four elements. 

To see this, add 1 to both sides of (13.24) and write F(x) = 1+ f(x). Then 
(13.24) reduces to F(x + y+ xy) = F(x)F(y). Now, replacing x by x — 1 and y 
by y — 1, we obtain M(xy) = M(x)M(y), where M(x) = F(x — 1). Thus, M is 
multiplicative and f(x) = F(x) — 1 = M(x +1) — 1, which solves (13.24). 

In a special case, f is an automorphism of the field R. Suppose M is also additive. 
Then M is a ring homomorphism of R. If M is a nontrivial homomorphism, then 
f(x) = M(x) = x; thatis, f is an automorphism of the field R. 

Note that the solution of (13.24) is obtained without assuming any regularity 
condition on f. 

We determine the general solution f, p,qg,g,h : R* — R of the functional 
equations 


(13.24a) f@+y+xy)= pa)+qQ)+e@)rQy), forallx,y € R’, 


and 


(13.24b) f (ax + by + cxy) = fx) + FO) + FR)FO), 


which are generalizations of the Pompeiu functional equation (13.24). We present a 
method that is simple and direct to determine the general solutions of (13.24a) and 
(13.24b) without any regularity assumptions. 

The following two lemmas are instrumental for establishing the solutions of 
(13.24a) and (13.24b). 


Lemma 13.13. Let g,h : R* > R satisfy the functional equation 


(13.25) g(xy) = g(y) + g@)hy) 
forall x, y € R*. Then they are given by 


g(x) =0, h(y) = arbitrary, 
(13.25a) g(x) = L(x), h(y) = 1, 
g(x) =a[M(x)—- 1], AQ) = My), 


where M : IR* — R is a multiplicative map not identically one, L : R* > Risa 
logarithmic function not identically zero, and a is an arbitrary nonzero constant. 
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Proof. If g = 0, then A is arbitrary and we obtain the trivial solution. We assume 
hereafter that g A 0. Using (13.25) a few times, we have 


g(xyz) = g(xy-z) 

= g(z) + g(xy)h(z) 

= g(z) + [e(y) + g@)AQ)IAG@) 

= g(x - yz) 

= g(y)h(z) + 8(z) + g(x)h(yz); 
that is, since g £0, 

h(yz) =hQ)h(), yz €R*, 

or 

h(x) = M(x), x €R*, 


where M is multiplicative. 

Ifh = M = 1, then (13.25) shows that g = L is logarithmic. Suppose h = M # 
1. Then interchanging x with y in (13.25) and comparing the resulting equation to 
(13.25), we get 


g(y[A(x) — 1] = g@)[A(y) — 1. 
Since h(x) 4 1, the equation above gives 
g(x) = a[h(x) — 1] = a(M(Qx) — 1), 
where a@ is a nonzero constant, since g 4 0. This proves the lemma. 


Lemma 13.13a. (Kannappan and Sahoo [514]). The general solutions f,g,h : 
R* > R of the functional equation 


(13.26) f(xy) = f(*)+ f(y) tag) + PhO) + 8@)h(y) forallx,y €R*, 


where a and f are a priori chosen constants, are given by 


(13.26a) f(@)=L(x)+af, g(x) =arbitrary, h(y)=-—a; 
(13.26b) f@)=Lx)+af, g(x)=—-f, hy) = arbitrary; 
f (x) = Lo(x) + cL} (x) + af, 

(13.26c) g(x) =cLi(x) — B, 


h(y) = Lily) —4; 
f (x) = Lx) + y O[M (x) — 1] + af 
(13.26d) g(x) = y[M(x) -1]-f 
h(y) = o[M(y) —l]—a, 
where M : R* — Risa multiplicative map not identically one, Lo, L1, L : R* > R 


are logarithmic functions with L, not identically zero, and c, 6, y are arbitrary 
nonzero constants. 
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13.1.3 Solution of the Functional Equation (13.24a) 


Theorem 13.14. The functions f, p,q, g,h : R* — R satisfy the functional equa- 
tion 


(13.24a) f@+y+xy) = p&)+40)+ 8@)hQ) 
for allx, y € R* ifand only if 
f@m=Law+1l+af+a+b, 
p(x) =La+1)+), 
(13.27a) q(y)=L(y+ 1) +af8 +a+ Phy), 
g(x) = —f, 
h(y) = arbitrary; 
f@)=La+1)+ah+a+b, 
pxy=Lat+1)+afP+b+ag(x), 


(13.27b) qiy) =L(iy +1) +4, 
g(x) = arbitrary, 
h(y) = —a; 


f(x) =Lawt+1)+yd[M(x+1)—-1+afP+a+b, 
pix)=La+1)4+ (6+ a)y[(M(x 4+ 1)- 1) +8, 
(13.27c) qy)=LY+ D+ + AIM(y +1) — I +a, 
gx) =yIM+1)-1-8, 
A(y) = o[M(y + 1) —- 1]—-a; 
fx) =Lox+ D+ 5cLiet+l)+afh+at+b, 
p(x) =Lo(x +1) + scLi(x +1)+acL(~«4+1)+5), 
(13.27d) q(y) = Loy +1) + 5cLiy+ D+ 6Li0 +) +4, 
g(x) =cL(x4+1)-f, 
A(y)=Lifyt+1)—a, 
where M : R* — R is a multiplicative map not identically zero, Lo, L1, L : R* > 


R are logarithmic maps with L not identically zero, anda, B, y, 6, a, b, care 
arbitrary real constants. 


Proof. First, we substitute y = 0 and then we put x = 0 in (13.24a) to obtain 


(13.28) p(x) = f(«) —at+ag(x) 

and 

(13.29) q(y) = f(y) — b+ Bry), 

where a := q(0), b := p(O), a := —h(O), B := —g(0). Letting (13.28) and 


(13.29) into (13.24a), we have 
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fatytxy) = fx)+fQ)-a—b+ag(x) + Bh(y) + g(x)aAQy) 
for x, y € R*, which by replacing x by u — 1 and y by v — | and then defining 
Flu) := fu-1)-a-b, Gtu):=g(u-1), H(t) :=htu-1) 
for all wu € R* yields 
(13.26) F(uv) = F(u) + F(v) +aG(u) + BA (ve) + Gw)A(v) 


for all u,v € R*. The general solution can now be obtained from Lemma 13.13a. 
The first two solutions of Lemma 13.13a (see (13.26a) and (13.26b)) together with 
(13.28) and (13.29) yield the solutions (13.27a) and (13.27b). The next two solutions 
(that is, solutions (13.26c) and (13.26d)) yield together with (13.28) and (13.29) the 
asserted solutions (13.27c) and (13.27d). This completes the proof of the theorem. 


13.1.4 Solution of the Functional Equation (13.24b) 
The only constant solutions of (13.24b) are f = 0 and f = —1, so we look for 
nonconstant solutions of the functional equation. 


Result 13.14a. (Kannappan and Sahoo [515]). The function f : R + Risa solution 
of (13.24b) if and only if f is given by 


M(cx) -1 ifa=0=b,c 40, 

E(x)-1 ifa=1=b,c=0, 
f(s) = : 

M(cx+1)-1 ifa=1=b,c#0, 

k otherwise, 


where M : R — Ris multiplicative, E : R — R is exponential, and k is a constant 


satisfying k(k + 1) = 0. 


13.2 Quadratic Differences 


Let G and H be Abelian groups and H be divisible. Fora map f : G — H, the 
difference 


(13.30) Fat+y)+ f@—y)-—2f@)—-2f), forx,y eG, 


is called the quadratic difference. 
If we denote the quadratic difference (13.30) by F, 


(13.30a) Fix,y)=faty)+ f@ —y)—2f(@) —2F(), 


where F : G x G — H, then F satisfies the functional equation [765], 
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(13,31) F@+y.2)+F &—y,2)—2F G2) = F422) +6 &, y—z)—2F (x, 9). 


It is known that this equation is not sufficient to guarantee (13.30a), even when 
G = H = R (the additive real group). On the other hand, some sets of sufficient 
conditions for (13.30a) are known. For instance, (13.31) and boundedness of (13.31) 
and two times continuous differentiability are sufficient when F : R? > R. 

Some sets of conditions that are both necessary and sufficient for F : G — H to 
have the representation (13.30a) are given in [244]. Let G be an Abelian group and 
H be a uniquely divisible Abelian group. 

The following result on Cauchy differences is used to prove Result 13.16. 


Result 13.15. (Ebanks and Ng [244]). A map K : G> => H is symmetric and 
satisfies the condition that 


(13.32) (x,y) K(x, y,w) is a(CD) Cauchy difference for each w € G 


if and only if K itself is a Cauchy difference of order 2; i.e., there exists a map 
g:G — A such that 


(13.33) K(x, y,z) = g(+y+z)—g(x+y)—g(«+z)—-8(yt+z)+8@)+8(y)+8(z) 


forallx,y,z€G. 


For a given F : G x G — H, let F, and F; be the canonical odd and even parts 
of F. That is, define F,, Fe: G x G — H by 


Fox, y) = [F(x,y) — F(—x,—y)], 


Lenire 


F(x, y) = [F(x,y) + F(—x,—-y)]. 


2 


Furthermore, we decompose Fy into its symmetric and skew-symmetric parts Fe4 
and F._, 


Foy (x, y= [Fe(x, y)+ Fe(y, x)], 


[Fe(x, y) = Fe(y,x)]. 


NIlLR Nelle 


Fe- (x, y) = 
Result 13.16. [244]. In order for F : G x G — H to have quadratic decomposition 
(13.30a) in terms of an arbitrary f : G — H, it is necessary and sufficient that 


(a) F satisfy (13.31), 
(b) the map Ke+ defined by means of Fe4. satisfy (13.32), and 
(c) F satisfy 


(13.34) F(w+y,x—y)+2F (x,y) = F(x,x)+ FO, y)+F(0,0) forx,y €G. 
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13.2.1 Differences in a Prescribed Set 


First we consider the Cauchy difference and then the Pexider difference. 
Let X be a real linear topological space (always assumed to be Hausdorff) and 
f : X — Rsuch that (CD) € Z; that is, 


(13.35) faaty)-f@)-fo)EZ forx,yeX. 
The question is under what condition f has the form 
(13.36) f=At+k, 


where A : X — R is a continuous linear functional and k : X — R takes integer 
values only. 

An example in Godini [321] shows that (13.35) is not generally true, where f : 
R — R must be of the form (13.36), where A is additive, and k : R —- Z. However, 
such a representation is possible under some regularity condition imposed on f. 


Result 13.17. (Baron [98]). If the Cauchy difference (CD) (where f :R — R) asa 
function of two variables is Lebesgue measurable and takes integer values only, then 
there exists an additive A : R — R and a Lebesgue measurable k : R > R such 
that f has the form (13.36). 


Result 13.18. (Baron and Volkmann [102]). Jf a Lebesgue measurable function 
f :R —- R satisfies (13.35), then there exists a real constant c and a Lebesgue 
measurable k : R — Z such that f (x) = cx +k(x) forx €R. 


Result 13.19. [102]. Let X be a real linear topological space. If f : X — R satisfies 
(13.35) for all x,y € X and there exists a set P © X such that 0 € int(P — P) 
and f(x) € Z+ (-t. t for every x € X, then there exists a continuous linear 
functional A: X — Randak: X => Z such that f has the form (13.36). 

Result 13.20. [99, Theorem 4]. Let X and Y be topological vector spaces and P be 


a countable, discrete subgroup of Y. 
Suppose f : X — Y satisfies 


(13.35a) faty)-—f@)-—fO)EP forx,yEE. 


Then, whether X is a Baire space and f is Baire measurable or X is a separable 
F-space and f is Christensen measurable, there always exists an additive function 
A: X — Y such that 


(13.36a) f(x)-A@)eEP, forx eX, 


of a local basis with the additional assumption of a local function A : X — Y such 


that (13.36a) holds. 


Result 13.21. (Baron and Kannappan [100]). Assume that X is a real topological 
vector space and M is a o-algebra of its subsets such that there exists a o-ideal T 
of subsets of X fulfilling the following three conditions: 
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1X €T. 
2. IfP € MNT, thennP € T for every positive integer n. 
3. If P € M\T, then 0 € int(P — P). 


Let Y bea locally convex real topological vector space and P be a countable discrete 
subgroup of Y. If afunction f : X — Y fulfills (13.35a) and f—~'(W) € M for every 
open subset W of Y, then there exists a continuous linear operator A : X —> Y such 


that (13.36a) holds. 


Motivated by the results above, the following results are proved in [137]. Let X 
be a real linear topological space, G(+) a topological group, P a normal discrete 
subgroup of G, and M a family of subsets of X. 


Definition. A topological group G(+) is o-bounded if and only if for each neigh- 
bourhood V of 0 there exists a sequence {x,} in G such thatG = U{V+x,:n € 
Z*}. 

Result 13.22. (Brzdek [137]). Let f : X — G be continuous at 0 and satisfy 


(13.35a). Then there exists a continuous additive A : X — G such that (13.36a) 
holds. 


Result 13.23. [137]. Suppose that G is a o-bounded topological group and there is 
a o-ideal, T, of subsets of X such that 0 € int(C — C) for eachC € M\T. Ifa 
function f : X — G fulfills (13.35a) and there is a set H C X, H ¢ T, such that 
the function f |4 is M-measurable, then there exists a continuous additive function 
A: X — G such that (13.36a) holds. 


Result 13.24. Assume X is a topological vector space, G is a group, P is a nor- 
mal subgroup of G, and f : X — G satisfies (13.35a). If U C X is a balanced 
neighbourhood of origin and C is a symmetric subset of F (i.e., —C = C) such that 


PN(C+C+C)={0} and f(U)CP+C, 


then there exists an additive function A : E — F such that AU) C C and f (x) — 
A(x) € P for every x € X. 


Let X be a real topological vector space, Y be a real Banach space, P be a 
subgroup of Y, and C be a symmetric (i.e.. —C = C) subset of Y. Consider for 
f :X — Y the congruence (difference) 


(13.35b) faty)—-f@)-fO)EP+C forallx,yeX. 


Theorem 13.25. (Baron, Simon, and Volkmann [101]). Assume X is a real topolog- 
ical vector space, Y is areal Banach space, P is a discrete subgroup of Y, and C is a 
symmetric, convex, and compact subset of Y such that PCA (6C) = {0}. If f : X > Y 
satisfies (13.35b) and is continuous at at least one point, then there is a continuous 
linear function A: E — F such that 


f(x~)-A@)eEP+C forxeX. 
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13.3 Pexider Difference 


Now we consider the Pexider difference 


(13.37) f(x +y)— g(x) -hAGy), 


where f,g,h : G — H, Gis a groupoid with identity 0, and H is a group, and 
obtain representation of f, g, 4 when (13.37) takes values in a fixed subgroup of H. 

We start with the following useful lemma that is in tune with the extensions 
considered in Chapter 1. 


Lemma 13.26. [99]. Suppose X is a topological vector space (real or complex), U is 
a balanced neighbourhood of the origin, and H is an Abelian group. If Ay : U > H 
is additive (i.e., 

Ao(x + y) = Ao(x) + Ao) 


for allx,y € U withx + y € U), then there exists a unique additive function 
A: X — FH such that A is an extension of Ao. 


Proof. Using induction, it is easy to prove that 
(13.38) Ao(nx) = nAg(x) 
for every x € U with nx ¢€ U. (Recall that U being balanced implies aU C U for 


|a| < 1.) Forx, yin U and integers m, n with mx = ny, we have, using (13.38), 


(13.39) mAo(x) = mnAg (=) = mnAgo (=) = nAg(y). 


m 


Now we introduce a function A : X — H by 
A(x) =nAo (-) », x EX, 
n 


where n is any integer such that x/n € U. This function A is well defined because 
of (13.39). It is easy to verify that A indeed is additive on X, is equal to Ag on U, 
and is unique. This completes the proof of the lemma. 


Now we prove the following theorem regarding the Pexider difference (13.37). 


Theorem 13.27. [99]. Suppose G is a groupoid with identity 0 and H is a group. 
Let P be a subgroup of H.A triple (f, g,h) of functions mapping G into H fulfills 
the condition 


(13.37a) fet y=— ee) —20) €P 


forall x,y € G if, and only if, there are functions a, B: G—> K, 6: G—> H and 
constants a,b € H such that 


(13.40) o(x+y)—o@)-¢0) € P, 
forallx,y € G, and 


f(x) =aQ)+o()+a, gx)=b-at(x)+a, h(x) =B(x)+4¢Q@)+a—b, 
for everyx €G. 
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Proof. Assume that (13.37a) holds for all x, y € G. Then, setting in (13.37a) first 
y = 0 and then x = 0, we have 


(13.41) u(x) = fx) -— g(x) -AO)E P, forx EG, 
and 
(13.41a) v(iy) = f(y) —g(0)-—h(y) € P,  foreveryy €G. 


From the two relations above, we get 
(13.42) 
g(x) = —h(0) -— w@®)+ f@), A@)=—-v@)+ f@)-—80), forx eG. 


Putting (13.42) into (13.37a) and using (13.41a), we obtain 
(13.43) fa@ty)— f)+u)+hO0)+ g0)—fQ)eP forallx,y eG. 


Let x = 0 = y in (13.37a) to get f (0) — g(0) —h(O) € P. Definea, 6,6: G—> H 
by 

(13.44) a(x) = ux) +hO)+ g0)— fO), Ba) =—v@%) +k), 
(13.45) p(x) = —k(x) + fx) — FO). 


Indeed, a(x) and f(x) belong to P, and 


d(x +y)—- &@) - 0) 
=-a(x+y)+ fxty)— f(x) +a(x)+ fO)- fO) +a) 
=—a(x+y)+[f@t+y)— f@)+ vu) +h(0) + g0)-— fO)]+ 40) 


also belongs to P for all x, y € G because of (13.43). Thus (13.40) holds for all 
x,yeEeG. 

Finally, putting a = f(0), b = g(0), from (13.45), (13.42), and (13.44), we 
obtain the desired forms of f, g, and h. The converse is easy to check. 


Result 13.28. [99]. Let X be a topological vector space, F be a topological Abelian 
group, and P be a discrete subgroup of F. Suppose ¢ : E — F satisfies (13.40) for 
all x, y € X and is continuous at the origin. Then there exists a continuous additive 
function A : X — F such that 


(13.46) d(x) -A(x) Ee P foreveryx € X; 
Lé., 

d(x) = A(x) + A(x) for everyx € X, 
where 2: X — K is continuous at the origin. 


Proof. Since P is discrete (that is, every element of P has a neighbourhood contained 
in F containing only that element), there exists a neighbourhood V C F of zero such 
that P 1 V = {0}. Further, there exists a neighbourhood W C F of zero such that 


532 13 Difference Equations 
—-W=W, W+W+WcV 
(Rudin [715]), and a balanced neighbourhood U C E of zero such that 
pU) C dO) +W 
[715]. Then, for x, y ¢e U withx + y € U, 
[p+ y) — 60)1 — Lo) — )] - 140) -@O] eW-W-We’. 

On the other hand, from (13.40), (0) € P and 

[p(x + y) — 6(0)] — [&) — 6)] — [2) — @O)] € P. 


Consequently, 


P(x + y)— 60) = [) — 60)1 + [60) — 60)] 


for allx, y e U withx+y € U. Therefore, by Lemma 13.26, there exists an additive 
function A : X — F such that 


(13.47) f(x) — ¢(0) = A(x) forevery x € U. 


In particular, A is continuous everywhere. Further, if x ¢ FE, then * € U for some 
positive integer n, and using (13.47) and (13.40), we get 


we) ats) 0 (2) ~n0 2) +00 2) -n0 (3) 
=[¢ (n-) —ng (-)| +no(0) € P. 


This proves the result. 


In the next result, we assume a measurability condition on ¢ instead of the con- 
tinuity at the origin. 
Result 13.29. [99]. Let X and F be topological vector spaces and P._ be a countable 
discrete subgroup of F. Suppose 6 : X — F satisfies (13.38) for all x,y € X. 
Then, whether X is a Baire space and ¢ is Baire measurable or X is a separable 


F-space and ¢ is Christensen measurable, there always exists an additive function 
A: X — F such that (13.46) holds. 


13.3.1 Some Generalizations 


We now give the general solutions of the functional equation 


(13.48) 
f(q,u) — g(pq, uv) —h(q — pq, uw) = A(p, q, 0d, w) 
(p,q € 1, u,v, w € 0, 1[> with ov + w € JO, I[*) 


in which f,g,4 : Ix ]0,1[°— R are Lebesgue measurable functions in their first 
arguments, A(p,-,v, w) is an affine map of J into R, and s is a nonzero cardinal 
number. This functional equation is motivated from a functional equation that arises 
from the study of measures of information. 
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13.3.2 Measurable Solutions of the Functional Equation (13.48a) 


(13.48a) F\(q,u) — Fo(pq, uv) — F3(q — pq, uw) = A(p,q,0v, w). 


We shall extend the following result. The measurable solutions of the functional 
equation 


F\(q) — Fo(pq) — F3(q — pq) — A(p.q); 
q — A(p,q) affine, for p,g € 1, 


are given by 


Fi(p) =aplogp+bipt+c, i=1,2,3, 
A(p, q) = al[—plog p — (1 — p) log(1 — p)lq + (63 — b2) pq 
+(b) —b3)g+e1—c2-0¢3, forp,g € I, 
where a, bj, cj are constants. 


Result 13.30. (Kannappan and Sahoo [514]). Let F; : Ix J0,1[°—> R be func- 
tions that are Lebesgue measurable in their first arguments (i = 1,2,3). Then the 
solutions of the functional equation (13.48a) with q > A(p,q,v, w) affine, for 
p,q € 1, u,v, w € JO, 1[% with v + w € J0, 1[°, are given by 

Fi (p,u) = aplog p + [L(u) + bilp + (L2 + L3)(u) +. c1, 

F(p,u) = ap log p + [L(u) + balp + Lau) + c2, 

F3(p,u) = aplog p + [L(u) + b3]p + L3(u) + ¢3, 
A(p,q, 0, w) = a[—p log p — (1 — p) log(1 — p)lq + [b1 — bop — 631 — p) 

— pL(y) — 0. — p)L(w)]q — £20) — L3(w) + c1 — 2 — 63, 
(p,q € 1, u,v, w € 0, I> withv + w € JO, 1[°), 


where L, L2, L3 are logarithmic (i.e., satisfy (L)) and a, bi, cj are constants. 
Let f : Rx R > R bea function on the plane R*. The second difference of f is 
Ff Q@ity1,%2+y2)— f 1 +1, x2 — y2)— f 1 —y1, X2 + y2) + f 1 — 1, x2— ya), 


where y; and yp represent the increments. The second difference of f is said to 
admit product form if there exist functions g, h : R x R — R such that 


(13.49) f (x1 + y1,*2 + yo) — fer + y1,%2 — y2) — f%1 — y1,%2 + y2) 
, + f(%1 — y1,x2 — y2) = g(%1, x2)h(1, y2) 


for all xj, yj € R @ = 1, 2). A special case of (13.49) is the equation 


f (xi + y1,x%2 + y2) — fi + yi, x2 — y2) 


(13.50) 
— f(x — y1, x2 + y2) + f@1 — y1, x2 — y2) = 0, 
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for all x1, y1,.x2, y2 € R. The functional equation above reduces to 


fQr+t.x2 +t) — fxr +t, x2) 


(13.51) 
— f(x1,x2 +t) + f@1, x2) = 0, for all x1, x2,t € R, 


if one first substitutes yj = y2 = 4 and replaces x; by x; + 5 and x2 by x2 + 4. The 
functional equation (13.51) was investigated by many authors (see [628]). 

We study the class of functions whose second difference admits product form. 
In particular, we determine the general solution of the functional equation (13.49) 
for all x1, v1, x2, y2 € R. Since our proof extends to structures more general than R, 
we in fact consider the domain of f, g, 4 to be a product of arbitrary groups and the 
range to be a quadratically closed field of characteristic different from 2. Here, for 
f,g,h:G, x G2 —> F, we consider the functional equation 


f(xiy1, x2y2) — f iyi, x25 ') — fy], x22) 


(13.49a) ae 
+ fy,» x2¥7 ) = (x1, x2)A(V1, y2) 


with f satisfying the conditions 


(KC) f(x1yizi,%2) = f(riziyi,x2) and f (x1, x2y2z2) = f 1, x222y2) 


for all x;, yi, zi € Gj @ = 1, 2). Since we do not assume the groups to be Abelian, 
the condition (KC) plays an important role for solving (13.49a). The general solution 
of (13.49a) with (KC) is determined by deriving two related functional equations, 


g(xiy1,x2y2) + g(xiyi, x2¥)') + gery; |, x22) 
+ g(xiyy |, x2y5/) = 8x1, x2) P (1, y2) 


and 


A(xiy1,x2y2) — Ai y1, x29) |) — ACery;, |, x2y2) 
+ A(xiy, |, x2y5) = p(x1, x2)h(1, y2). 


We determine the general solution of (13.49a) in the following result. 


Result 13.31. (Kannappan and Ng [493]). Let G1, G2 be arbitrary groups and let 
F be a quadratically closed field of characteristic different from 2. The complete list 
of functions f, g,h : G1 x G2 —> F satisfying the functional equation (13.49a) with 
(KC) is 


Ff (x1, x2) = B(x1, x2), 
g(x1, x2) =0 and h arbitrary, or 
h(x1, x2) =0 and g arbitrary, 
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fi, x2) = B(x, x2) + afar £1 (01) — a2 £1 (x1) ')E2(x2) 
— [Bi E101) — BoE i (x1) ')E2(x2) 1}, 
(x1, x2) = [a1 £1 (01) + a2 E1001) )E2(x2) + [A E11) 
+ BrE\(x1)7'}E2(x2)!, 
AQ, y2) = @[E1 (1 — E1Q1)7 "WE 1(y2) — E1(y2)7'1, 
with Ei(xi) # Ei@zi)’ (= 1,2); 
f (x1, x2) = B(x1, x2) + Eio(x1){la1A (1) + 41 JA@1) £222) 
+ [a2A (x1) + a2]A@1)E2@2)"}, 
e(x1,42) = Eto(x1){[4a1A (x1) + 2a1] E2(x2) + [402A (x1) + 2a2] E2(x2)~"}, 
h(y1, y2) = E1o(v1) AQ E292) — E2(y2) |], 
with Ex(x2) # Ex(x2)7!: 
f (x1, x2) = B(x1, x2) + {E1(%1)[a1A @2) + a1 JA (x2) 
+ Ey (x1) '[a2A(x2) + a2] A (x2)} Er0(x2), 
(x1, x2) = {E1(«1)[401A (x2) + 2a1] + E1 (x1) |[402A (x2) + 2a2]} £20 (x2), 
A(y1, 2) = E101) — £101) 'JA(2) £2002), 
with Ey (x1) # Ei(x1)7!: 
Ff (1, 2) = B(x1, x2) + Eto(1) E20(x2)[03A1 (01)? A2 (2)? 
+ a Ay (x1)?An(x2) + aA (1) A2(a2)® + 2A @xr)Arle9)), 
8(%1, 2) = Eto(%1) E20 (%2)[1643A 1 (x1) A2(x2) + 841 A1 1) + 8a2A2(x2) +a], 
A(y1, ¥2) = E10(¥1) £20(y2) A101) A202); 
f (1, x2) = B(x, x2) + 5 E101) E202) B(1, x2), 
8 (x1, ¥2) = 4E10(1) E20(x2), 
h(y1, y2) = Eto(y1) E20(y2)BO1, y2)s 


where a, a; (i = 1,2,3), Bj, aj (7 = 1,2) are arbitrary constants, the maps 
A, Aj, Az are additive, the map B is biadditive, E;, Ejo are exponential maps with 


Ejo(xi) = Ei(xi) = Ei(xi)', 
and f is an arbitrary solution of 
B(xiyt, x2y2) — Boyt, x2yz') — BOriyy | x2y2) + Bay, |, x2yz') = 0 


with Pp satisfying (KC). 
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Characterization of Special Functions 


This chapter is devoted to functional equations connected to a few special func- 
tions such as the gamma function, beta function, Riemann’s zeta function, Riemann’s 
function, Cantor-Lebesgue singular functions, Minkowski’s function, and Dirichlet’s 
distribution. 


14.1 Gamma Function 


Euler used the gamma function as an interpolation function for the factorials n! = 
1-.2---n [261]. Legendre [604] introduced the notation T such that [(n) = (n — 
1)! forn € yale As a matter of fact, it was D. Bernoulli [121] who gave the first 
presentation of an interpolating function of the factorials in the form of an infinite 
product, later known as the gamma function. Euler gave representations by integrals 
and formulated interesting theorems on the properties of this function. The gamma 
function appears in physical problems and applications. The gamma function is 
especially useful to develop other functions that have physical applications. The 
function 


(14.1) Tn) =(n-1)!, forne Z*, 


is defined for positive integers n by recurrence (1) = 1, T(#2 +1) =aI(n). 


It is natural to expect to extend this to positive reals as solutions of the equation 
(14.2) ft 1)=xf() forx > 0. 


Some of the well-known representations of the gamma function I’ are (Gronau 
[324]) 
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nin* 
14.3 P@) = lim ———W— 0, oof, 
(14.3) (x) eS era x € ]0, oof 
CO 
(14.4) T(x) = er ai. 0; 
0 
1 
(14.5) re) = (— log y)*~!dy, x>0, 
0 
oa S 
(14.6) T(x) =) e*e © ds, x >0, 
=00 


and I satisfies, in addition to (14.2) to (14.5), 


n—1 


k 
(14.7) n* 37] Ff (- i ) = (2n)2"-) f(x), x >0, ne Z%, 
k=0 


n 


ri 


(14.8) f@a)yfd—-x)= ; x € ]0, 1[, 


sin Xx 
(14.9) T(x) | pie fe2)- > 0 (due to Euler) 
. oo = = ue tO Buller). 
e Xl n n ' - 


The integrals (14.4), (14.5), and (14.6) are equivalent. The substitutions y = 


t 


e', s = logt separately in (14.4) result in (14.5) and (14.6), respectively (y = ee 
reduces (14.5) to (14.6); t = e* transforms (14.6) to (14.4)). The functional equation 
(14.2) is the most well known among them since it is the simplest, most elegant, and 
most remarkable formula. It is also known as the gamma functional equation. 

The logarithmically convex function satisfying (14.2) with f(1) = 1 is called 
Euler’s gamma function. 


Remark. ['(x) is not the only solution of (14.2). For example, 
f@)=T@ye?™ or F@)p(r), x>0, 


where p is an arbitrary periodic (analytic) function of period | is also a solution of 
(14.2) (see [554]). Indeed, 


fH FD =e + DePOt) = xT rye? = xf (x). 
So, some additional condition is necessary on f to characterize [ through (14.2). 
Notation. Forn,v € Z, define (v), = vv+1)---WV@+tn—1), VY) =1. 


Theorem 14.1. (Anastassiadis [76], Bohr and Mollerup [126]). T : JO, co[ — ]0, oo[ 
logarithmically convex is the unique solution of equation (14.2) with T(1) = 1. 


Proof. Let f satisfy (14.2) with f(1) = 1 and be logarithmically convex for x > 0. 
From (14.2) there follows, forn > 2,0 <x <1, 


(14.10) f@t+tn=O)nf), Ffm=@—-D!. 


Since f is logarithmically convex, e™ f (x) is also convex for x > 0 and a aconstant. 


14.1 Gamma Function 539 


For 
n-l<n<n+x<n+l1 (ninZ)>2,0 <x <1), 


one has from convexity 


et") Fn _ 1) _ e"" f (n) y ett) Fn +x) = e" f (n) 


—1 _ % 
2 et +) Fn a 1) _ e@” f (n) 
a 1 
and from (14.10) 
—1)! —1)!—xe~“(n — 2)! an!) — — 1)! — 1)! 
x(n yl+(n 1) xe"(n a ce ee ae= +a )t 
e*(x)n e*(x)n 
Replacing n by (n + 1) in this inequality, one gets 
xn! +n! ane i — 1)! 2etiy2 xe“n! —x(n—1)!4+ (n—-1)! 
eft (X)n41 e(X)n 
Taking a = — logn in the inequality above, we have 
nin* < f(x)< ninX x+n 
(X)n+i (X)nti n 
or 
(14.3) f@)= =T(x) @e(,1)). 
ne a 


We have f(x) = I(x) for x € (0, 1]. For the other values of x, (14.3) results from 
the functional equation (14.2). 


Proof from Kuczma [554]. Theorem 5.11 in [554] shows that there exists a unique 
function fulfilling the hypothesis. Since logx is completely monotonic, the func- 
tion — log I(x) is also completely monotonic (cf. Theorem 5.12 and the corollary in 
[554]) and thus I(x) is analytic in (0, oo). Further (by Formula 5.27 in [554]), we 
obtain 


Ss (n+ In +2)*7! 
mer pmt 


n—2 


T(x) 


n=0 


_ (K+ 1)(k +2)*7! 
> ell (k +x)(k + 1)*- 1 


(n —1)!n*7! 
= kn ———_— 
noo x(x + 1)---(wW +n —2) 


> 


whence, since im 


360 (4+n— a =l, 


nin* 


(14.3) a ae RE | ee 
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Theorem 14.2. [fa function f (x) satisfies equation (14.2) and, moreover, 


lin, (EY Jaf =1 


then 
f@) =T@). 
Proof. Taking (in Theorem 5.8 in [554]) Q(x) = (2) = secbedin 
Ire tn) x-+n 
ijn 
(14.11) noo Je Fax(x + 1)---(e+a—1) 
Tetra tnt! aint 
Si. 
n—>0o nin*./x +n x(x +1)-+-(« +n) 
Further, 
ary Vee PPG ew _ ee tnt ng (ete 
nin*./x +n nij/x +n n 


In view of Stirling’s formula 
nl=n"e"J/2nne!/!" 0 <6 <1, 


expression (14.12) tends to 1 asm — oo. Thus f(x) = I(x) results by (14.11) and 
(14.3). 


Result 14.3. [f a function $(x) satisfies (14.2) with 6(1) = 1 and, moreover, 
(e/x)*b(x) is monotonic, then d(x) = T(x). 
14.1.1 Further Properties of the Gamma Function 


w(x) = 2*T (5*) r (56 + ») . 


The function y (x) is logarithmically convex in (0, 00); moreover, 


Set 


w(x +1) = 2*t!P (50 + ») r (5: + i) = xy(x). 


Thus w(x) may differ from I(x) only by a multiplicative constant, 
fi 1 
(14.13) 27 ha Tr ha + 1)) =dTQ), 


where d = [ (4) (Legendre’s formula). 
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Theorem 14.4. (Artin [80]). [fa positive function ¢ € C![(0, 00)] satisfies equation 
(14.2) and 


(14.14) 2 (5)() ado), @s20 (;). 


then d(x) = T(x). 
Proof. The function a(x) = log d(x) — log I(x) satisfies the equation 


(14.15) «(5) +a (=) = 000 forx > 0. 


Indeed, 


0 ($) +0(24) = tore) +e (4) 
—logT (5) —logT (=) 


by (14.14) and (14.13) =logdd(x) — log2* — dog dI' (x) — log 2x) 
= log d(x) — log F(x) = a(x). 


From [80, p. 38], it follows that a(x) = 0. Thus g(x) = T(x). 


14.1.2 Definitions of the Function I (x) 
For v,n € Z7., it is obvious that 


n! ee aad 


aa" eis 


: n+1n+2 n+v 
lim ——:-- ie 


n n n 


Since 


n> oo n n n 


one can have 
nin? 


(v—1)!= lim . 
noo )n+1 


Now treat v as a variable (positive) and define 
n!n* 
(*)n+1 , 


where x is a variable greater than zero. Then 


Vn (x) = 


In = ee ee) 
(ig Eine. = gt (logn-1-3-~-—7) 
(X)n41 


1 1 7 1 1\ + 
lim {1+=-+4+---+-—-—logn})=y = —-—fe dt 
n—>00 2 n 0 l-e“ ft 


is known as Euler’s constant. 
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Since the infinite product 


converges uniformly and the series 


> flee (1+ 5) 2] 


n=1 


converges absolutely and uniformly, one can define the function 


x 


(14.3) T(x) = lim = 


forx > 0. 
n—0o (X)n+1 


It can be shown that (14.14) converges in the interval (—k, —k + 1), k € Z%, soT 
can be defined by (14.3) for all values of real x except for the values 0, —1, —2,.... 
It follows from (14.14) that 


. ee ses 
T(x)=e?*-= — (Weierstrass formula), 
x ul ~ 
from which follows Euler’s formula (14.9). 
Remark. Replace x by x + 1 in (14.14) to get 
x+1 


-ni n 
n(x + ) @ + Dat x n(x) rarer as | 
so that P(x + 1) = xT (a). 


Let us now take up (14.8). Set a(x) = ['(~)U — x)sinzx. This function is 
defined everywhere except at x = —n, n = 0,1,2,..., and is periodic with period 
1, a(x + 1) = a(x), so that it can be extended onto the whole real axis. Moreover, 


CO 
aé€C!in U ~@—1,n). Since 
—0o 


sin 2X sin 7x 


=Ta4+)rd—-x) ; 
x 


(14.17) a(x) =xT@)Td — x) 


a(x) is of class C! in (—00, 00). By (14.13), 


1l-x Xx : 
T r(i-5)= 2-11 — x), 
(*)r(1-$) <a2tra 
whence 


“(a(S )="@ro-Das (AS) Bos 


d 1 d* 
= seh @)d2*'T0 —x)5 sin rx = Tue). 
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Consequently, the function ¢(x) = log <a(x) satisfies (14.15), whence it follows 
[80, p. 38] that a(x) = d?/4. Setting x = 0 in (14.17), we obtain a(0) = z, whence 
a(x) =a andd =2,/z. 

The equality a(x) = z also immediately gives the relation 


(14.8) T@)P0 —x) = 


sin ax 


discovered by Euler. Equation (14.8) is sometimes referred to as Euler’s functional 
equation. Now (14.13) holds. 

More generally, one can prove the relation (due to Gauss and known as Gauss’s 
multiplication formula) 


p =2,3,4,..., where dy = p'/2 (2a) 2-D/2, Whereas each of the equations 


p-l ‘ 
(14.18) v T1o(=“) =dyb(x), p=2,3,4,..., 
j=0 


has continuous solutions satisfying (14.2) but different from I(x), it may be proved 
that d(x) = I(x) is the only continuous solution of equation (14.2) fulfilling all the 
equations (14.8) [80]. 

The Stirling formula for T(x) is 


T(x) = M2 xt e—* +O (H)/ 12x 
where 6(x) fulfills 0 < O(x) < 1. 
Now we treat the complex versions of (14.8) and (14.18). 


Theorem 14.5. (Haruki [356]). Let S = {0, —1, —2,...} denote the set of all non- 
positive integers and K = C\S its complement. If 


(i) f is a meromorphic function of a complex variable z on C, 
(ii) f is analytic and vanishes nowhere on D, 
(iii) and f satisfies the functional equation 


NI 


n—1 age ‘ 
k n= Gauss multiplication 
(14.7a) nn Ils ( = ) = (22)"F f(z) ( P 
k=0 


n formula) 


z—i 
on K, where n is an arbitrarily fixed positive integer greater than | and n* 2 


denotes the principal value exp ((< = s) Log n) : 
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then the only function satisfying (i), (ii), and (iii) is 


feiss (« (: = 7) + an) r@), 
2: n—-1 


where a is an arbitrary complex constant and m is an arbitrary integer. 
~_1 
Equation (14.7a) characterizes 1 (x) by a multiplicative factor. Here n*~? de- 


notes exp ((< = s) Log n) , where Log is the principal value of the logarithm. 


Remarks. As is well known, the function f(z) = I(z) satisfies conditions (i), (ii), 
and (iii) of Theorem 14.5. When n = 2, (14.7a) becomes 


Zz 1 
(14,14a) 27-'T (5) T (S ) = /xI(z) (Legendre’s duplication formula). 


Proof of the theorem. Replace z by z + 1 in (14.7a) to obtain 


(14.19) wt TT (=) = (2n)"F f(c+l). 


n 
k=) 


Observing that, by condition (ii), f(z) does not vanish anywhere on K and dividing 
(14.19) by (14.7a) side by side yields on K 


Pt) Fes) 


14.20 2 
ale 2 f (2) zf (2) 
Set 

ft) 
14.21 Feasts 
( ) (z) f@ 


so that, by conditions (i) and (ii), F is meromorphic for |z| < +oo and analytic on 
K. Furthermore, by (14.20), we get on K 


(14.22) F (-) = F(z). 


Let zo be an arbitrarily fixed point belonging to K, and we set b = F(z). Then, by 
(14.22) and iteration, we have 


(14.23) F (4) =5 a ee ae 


Furthermore, by (14.21) and condition (ii), we obtain b 4 0. 
Since F is meromorphic for |z| < +00, the point z = 0 is a pole or a removable 
singular point of F. We claim that the point z = 0 is a removable singular point. 
Assume the contrary. Then the point z = 0 is a pole of F. If we set z, = + (k= 
1,2,3,...), then we obtain lim zz =0.Hence we have lim F(z) =~. 
k—> +00 k—+00 
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On the other hand, by (14.23), we get ae F (zx) = b, which is a contradiction. 


Therefore, the point z = Oisa ee a singular point. Hence, by Riemann’s 
theorem on removable singular points (see [67, pp. 124—125]), F is analytic on the 
point z = 0. If we apply the Identity Theorem with f(z) = F(z) and zo = 0, then 
we obtain F(z) =bonK U {0}. 

Thus, by (14.21), we get 


(14.24) f(z+1) = bef (2). 


Now we prove that the point z = 0 is a simple pole of f. By (14.24) and by the 
fact that f is analytic at the point z = 1, we obtain 


fet _ f@ 


(14.25) Tim (zf()) = me; oS 


By condition (ii), a0) is a nonzero complex constant. But then (14.25) shows that 0 
is a simple pole of f. 

We claim that the point z = —k (k € Z7.) is also a simple pole of f. Repeated 
application of (14.24) yields on K 


fztkh =e. +k-1(z +k —2(e+k-3)--- (2+ Izf(z) 


or 


f(z+k) 
big tk —-lM(c+k—2)(¢+k—3)---(+)z 


From (14.26) and (14.25) there results 


(14.26) f@= 


tim (+ H£@) = 1) lim (GtbfE+h) 


mm 
= (-1)' olin lim ( 2f (z)) 
x fQ) 


OD Fe 


#0. 
Thus, the point z = —k is a simple pole of f. Let 


ge) 


(14.27) gs) = re. 


Since f and I are analytic and ['(z) 4 0 on K by (14.27), g is analytic on K. 
The set of all poles of f and I is the same: S = {0, —1, —2,..., —k,...}. Further- 
more, each pole of f and IT is a simple pole. Therefore, by Riemann’s theorem on 
removable singular points, g is analytic at each point belonging to S, so g is an entire 
function of z. 
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Substituting f(z) = ['(z)g(z) into (14.19) yields 


n—-1 a 
(14.28) I«(= ) =e 


k=0 


for all complex z. 

Now g(z) is never zero on K by condition (ii). Furthermore, since each of 
=a (k = 0,1,2,...) cancelled in the numerator and the denominator, the entire 
function g(z) is never zero. Hence there exists an entire function ¢ of z such that 


(14.29) g(z) = exp(P(z)) 


holds for all complex z. 
By (14.28) and (14.29), we have, for all z € C, 


n—-1 a 
(14.30) oo D0 (= ) -#©))= 1. 


k=0 


Differentiating (14.30) twice with respect to z gives 


” 4 22! r ig 
sO=5>4 (= ). 
k=0 
Hk) <q 4h, ta4+h < t04 


Set m = Max(|"(z)|) for |z| < 1. Since a + a a 
(n—1))=1(K=0,1,2,...,n — 1) for |z| < 1, we obtain 


o (24) 


Io" (z)| 


<m (k=0,1,2,...,2—1). 


Thus we obtain 


IA 


<—. — 
<a:nm 


Hence we obtain (0 <) m < @ (n > 1) and som = 0. Therefore we have $”(z) = 0 
for |z| < 1. By the Identity Theorem, we get for all complex z 


(14.31) d(z) =az+b, 


where a, b are complex constants. 
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n—1 
Substituting (14.31) back into (14.30) and using >) k = 5n (n—1)(n > 1) yields 
k=0 


exp ((r —1) € +b)) =1; 


that is, b = =s + anit , where m is an integer. 
Hence, 
1 2mrt 
(14.32) $(2) =a (: = 7) 4 
2 n—-1 


From (14.32), (14.29), and (14.27), we get on K the result sought, 


1 2mni 
peymer(o(2-2)+ 222) re, 
2 n—Il 


This proves the theorem. 


Remark. Since the left-hand side of the functional equation (14.28) is a product, 
we may think of taking the logarithms on both sides. However, since the complex 
logarithmic function is multiple valued, we need (14.29). 


Corollary 14.5a. Under the additional conditions that f(1) = 1 and f'() = 
T’(1) = —y, where y denotes Euler’s constant, the only function satisfying (14.7a) 


is bf (z) =T(z). 
Corollary 14.5b. Under the additional conditions that f (3) = (3) = ./x and 


po (4) = [’ (4) = —/x(y + 2 Log 2), where y denotes Euler’s constant, the 
function satisfying (i), Gi), and (iii) is f(z) = T(z). 


Euler’s gamma function satisfies on K 


(14.8a) T(z)ld-z)= nee 


The formula above is called Euler’s functional equation. As an application of Corol- 
lary 14.5b, we shall prove (14.8a) later. 


Proof of Euler’s functional equation (14.8a). Let 


us 


MO= Ta asinges) 


and 
p(z) =Td —2z)sin(zz). 


We shall show first that p is an entire function of z. Since it is obvious that p is 
analytic on the complement of Z*%, we shall prove that p is analytic at z = m with 
p(m) # 0, where m is an arbitrary positive integer. 
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By using the fact that the residue of F at z = —k € Sis (—1)* a we obtain 


jim: P(zZ) = jim ((—z + 1) sin(zz)) 
jim ((—zP'(—z)) sin(z)) 
( sin(zz) 


—Z +m 


—m lim 
Lt 


) jim ((~z +m)I(—z)) 


(-m)(—1)""'x_ Tim (+ m)P(z)) 


1 
= (-m)(—1)"™'a(-1)"— 
mM. 

a 


air re 


Hence, by Riemann’s theorem on removable singularity, p is analytic at z = m with 
p(m) # 0. Thus p is an entire function, and the set of all zeros of p is S. Hence, y 
is meromorphic in |z| < -++oo and is analytic on K. Furthermore, y never vanishes 
on K, so y satisfies conditions (1) and (11). 

Next we shall prove that y(z) satisfies condition (iii) with n = 2. By Legendre’s 
duplication formula, we obtain on K 


2 


zh (Z z+l)\_4:-4 ci 
te) ) arama 


z 
2 2 


es 2, 
2(1-2)-1p (45) r (49+) (2 sin (3) cos (%)) 
— v2x? 


a 
- 20a — z) sin(zz) 


= (2x) y(z). 


Hence y(z) satisfies the functional equation 


(14.14a) ty (=) wy (= -) = (22) = w(z) 


on K. 

Furthermore, by T (3) = Jz, sin (5) = |, we get y (3) = © (3) ae 
y! (4) =.’ (3) . So, by Corollary 14.5b, we have y(z) = F(z) on K and (14.8a) 
holds. 
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14.2 Beta Function 


Definition. The function £ : R* x R4_ — R (a real-valued function defined in the 
first quadrant), monotonic in the first variable x for a fixed second variable y (enough 
to assume the monotonicity for large x) and fulfilling the conditions 


(14.33) Bat+l1,y)= yh y), x,y € (0,00), 
(14.33a) Bd,y) = - 

y 
is called Euler’s beta function. 


Theorem 14.6. (Kuczma [554, 549]). We have 


T(ax)I 
(14.34) Bi, y= ee forx,y > 0. 


Proof. Keeping y fixed, we have (by virtue of Theorem 5.4 in [554]) 


B(x, y) 


1 l+n xtnt+y 
a eas 3 x+n 


— im AMO + 4D @tyta-) 
nin® 
UD a i ee 
novoxtytn x@+) Gn) y+) (+n) 
(~ty)\@+y4+1)---@t+yt+n) 
, ninxt+y ° 


and the theorem follows in view of (14.3). 


Theorem 14.6a. [76]. The function f : Ri, x Ri. > R satisfying (14.33) and 
(14.33a) and logarithmically convex or deapasne for large x is the beta function B 
given by (14.34). 


Proof. One can, by using (14.33), write 


(14.35) fatny)= e mat f(x,y), forne Zi, x €]0,1L 
and 
(14.35a) f+, y) = ——. 

(Y)n+1 


Since f (x, y) is supposed to be decreasing for large x, 


fa,y)=>fatn,y)>fa+ti,y) forx €]0, 1). 
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Because of (14.35) and (14.35a), we have 
@=D!  _@& 


7 — lee 
(Y)n (x +y)n oa 1+] 
or 1 
(a—Il@+y)n ee 1x + Y)n 
(Y)n(*)n (x)n (Ona 
This inequality is valid for all n. Replacing n by (n + 1), one finds 
| 
n\(x + y)n+1 > f(y) > m(x+y)nt1 xX+n 
(Y)n41 )n41 (nti @)nt1 x + y+ n° 
and since a — lasn — o, we have 
y+n 
nlx + y)n+1 
f(y) = lim, ————*— 
oe nats 
n 4 nind 
_ Him & oa im. (Y)n+1 
~ li nintty 
n—->oo +Y)n41 
P()P(y) 
= ——— =A, 
T@+t+y) 


Remark. / given by (14.34) is symmetric. Differentiating partially with respect to 
x, we obtain 


P@ + y)'@) — P@)'@ + Yr 


T2@+y) M0). 


Bx, y= 


Since ao + is a decreasing function for x > 0, we have 


'@) ’@t+y) 
T(x) T(a+y) 
and since '(t) > Ofort > 0, x(x, y) < 0. 


In the same way, f(x, y) is decreasing for y > 0. x = 1 in (14.34) yields 
(14.33a). 


<0 (y>0), 


14.2.1 Integral Representation of 8 


One can represent / by an integral 


1 
(14.36) poss f Poa=—71) "at forx,y > 0. 
0 


Legendre called this integral Euler’s integral of the first kind or the beta function. 
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Theorem 14.7. [76]. 2 given by (14.36) is indeed (14.34). 
Proof. Replace x by x + 1 in (14.36) to have 


1 x 
per+try= (agers (+) dt. 


1—n Sil t x 
k= 1-1)" | ——} dt 
[ a-o (+) 


which by integration by parts gives 


a (1 —1t)**” aa 
7 x+y 1-t 


Set 


1—-y 


ie 


l—n dd = ey 1 x=1 1 
a rs — 9 
+f x+y 2 (ee ) ua? 


By allowing €, 7 — 0, we obtain 


1 
s644,9= — | (spe ae #, 
x+y Jo x+y 


which is (14.33). 
Set x = | in (14.36) to have 


1 1 
pa.y= (-9la=-, 


which is (14.33a). 
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The integral is a decreasing function of x for x > 0 since t € ]0, 1[. Hence, by 


Theorem 14.6 or 14.6a, / is given by (14.34). This completes the proof. 


Now I insert the nice, interesting writeup of gamma and beta functions from 


Dragomir, Agarwal, and Barnett [228] (see also [77]). 


Introduction 


In the eighteenth century, L. Euler (1707-1783) [261] concerned himself with the 


problem of interpolating between the numbers 


CO 
(14.4) n= | e't"dt, n=0,1,2,..., 
0 


with noninteger values of n. This problem led Euler, in 1729, to the now famous 
gamma function, a generalization of the factorial function that gives meaning to x!, 


where x is any positive number. 
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The notation ['(x) is not due to Euler, however, but was introduced in 1809 by 
A. Legendre (1752-1833) [604], who was also responsible for the duplication for- 
mula (14.3) for the gamma function. 

Nearly 150 years after Euler discovered it, the theory concerning the gamma 
function was greatly expanded by means of the theory of entire functions developed 
by K. Weierstrass (1815-1897). 

The gamma function has several equivalent definitions, most of which are due to 
Euler. To begin with, we define 

x 
(14.3) (@= in, 
noo x(x + 1)(x + 2)--- (x +n) 
If x is not zero or a negative integer, it can be shown that the limit (14.3) exists. It is 
apparent, however, that I(x) cannot be defined at x = 0, —1, —2,... since the limit 
becomes infinite for any of these values. 
By setting x = | in (14.3), we see that 


T(1) =1. 


Other values of I’ are not so easily obtained, but the substitution of x + 1 for x in 
(14.3) leads to the recurrence formula 


(14.2) Tae +1) =x (x). 


Equation (14.2) is the basic functional relation for the gamma function; it is in the 
form of a difference equation. 

A direct connection between the gamma function and factorials can be obtained 
from (14.2) as 


(14.1) r@+i<Snt,. #=0,1;2,..4 


Integral Representation 


The gamma function rarely appears in the form (14.3) in applications. Instead, it 
most often arises in the evaluation of certain integrals; for example, Euler was able 
to show that 


[o.e) 
(14.4) ra)= | er dk eS 0, 
0 


This integral representation of I(x) is the most common way in which the gamma 
function is now defined. Lastly, we note that (14.4) is an improper integral due to the 
infinite limit of integration and because the factor t*~! becomes infinite if t = 0 for 
values of x in the interval 0 < x < 1. Nonetheless, the integral (14.4) is uniformly 
convergent for alla < x <b, where0 <a<b<wo. 

A consequence of the uniform convergence of the defining integral for (x) is 
that we may differentiate the function under the integral sign to obtain [77, p. 54] 
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(oe) 
(14.4a) l'(x) =f Vas eeu logtdt, x > 0, 
0 


and - 
res | e't*ldogt)*dt, x>0. 
0 


The integrand in (14.4a) is positive over the entire interval of integration, and thus it 
follows that T(x) > 0; i.e., P is convex on (0, 00). 

In addition to (14.4), there are a variety of other integral representations of (x), 
most of which can be derived from that one by simple changes of variable (Andrews 
[77, p. 57]), 


1 1 x-l 
(14.5) rays | (ioe ~) du, x>0, 
0 u 


and 


TOE) = [cost '0sin?" 9.40, x,y > 0. 
20(«+y) 0 


By setting x = y= 5 in the integral above, we deduce the special value 


14.2.2 Other Special formulas 


A formula involving gamma functions that is somewhat comparable to the double- 
angle formulas for trigonometric functions is the Legendre duplication formula 


: 1 
(14.13) 2 Tr (: + 7) = /nT (2x), x>0. 
An especially important case of (14.13) occurs when x = n (n = O,1,2,...) 
[77, p. 55], 


1 (2n)! 
r(n+3) = an V™ n= 0.1, 2-62 
Although it was originally found by Schlémlich in 1844, thirty-two years before 
Weierstrass’s famous work on entire functions, Weierstrass is usually credited with 
the infinite product definition of the gamma function, 


CO 
eee no =xe*|] (1 = ) em, 


n=1 


where y is the Euler-Mascheroni constant defined by 


n 
. 1 
,= im. bs a en = 0577215 620. 
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An important identity involving the gamma function and sine function can now be 
derived by using (14.9a). We obtain the identity 


(14.8) T@)rd-x)=- (x noninteger). 
sin 2x 
The following properties of the gamma function also hold [77, p. 63-65]: 
CO 
T(Qx)= sf gota, x,s >0; 
0 
CO 
roy exp(xt — e')dt, x > 0; 
j oO (= 1)" 

14.4 T = e 't* ‘dt > 0; 
ada) ray= fe poem, : 

T(x) = (logoy* [ Pad, x0. bs 

0 
T@) =I’'@+1)-xI'(), x >0; 
[o,@) 
T(x) = e'(t —x)t*!logt dt, x0; 
0 
1 —1)"27"-!(n — 1)! 
re eee rc cal 2 n=0,1,2,...3 
2 (2n — 1)! 


T an r : = (-1)" =0,1,2 : 
7 n Y nyjy= 7, N=V,1,4,.-45 


Tx) = 13-31 r(: =) r( 4) 0: 
= = 2x) fxt+ 3 x+ ra x > 0; 
(l’(x)? <T@)r’ (x), x>0. 


Beta Function 


A useful function of two variables is the beta function [77, p. 66], where 


1 
(14.36) B(x, y) al fe d=¢y dt, 2S 0, 9S 0, 
0 


The utility of the beta function is often overshadowed by that of the gamma function, 
perhaps partly because it can be evaluated in terms of the gamma function. How- 
ever, since it occurs so frequently in practice, a special designation for it is widely 
accepted. 

It is obvious that the beta mapping has the symmetry property 


B(x, y) = BY, x), 
and the following connection between the beta and gamma functions holds: 


P@)rQ) 


» x>0,y>0. 
T(x+y) sf 


(14.34) By) = 
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The following properties of the beta mapping also hold (see, for example, 
[77, p. 68-70)]): 


Bat+lLy)+8@,y+)D=G@.y), x,y > 0; 
(14.33) BG, y+) =28@+1y)=——fG, y), x,y > 0; 
x x+y 
B(x, x) =2!-?*B («.5). x > 0; 
BR, y)Pa+y, 2Ba+y+zZ,w) 
— T@)TO) rw) x asa 
~ Ta@t+tytztw)’ ieee , 
p (SPS?) = seo (HE). O0<p<l:; 


i feta 
H==) — 
B(x, y) 7f, Gap 
1 ,x-1 y—1 
. wae t*—*(1 —t)y 
= p+ a ——~ dt forx,y,p > 0. 
p (1+ p) b+ PD y,P 


14.3 Riemann’s Zeta Function 


The formula 


[ee 


(14.37) Oa) a 


n=1 


defines the function ¢(x) for Re x > 1. This function can be continued onto the 
whole complex plane except at x = 1, where it has a simple pole; thus it is a mero- 
morphic function. ¢(x), known as Riemann’s zeta function, satisfies the functional 
equation 


(14.38) c(1 — x) = 2(2)-* cos TeX), 


where I(x) is Euler’s gamma function. In view of relations (14.16) and (14.8), with 
x replaced by (1 — x)/2, (14.38) is equivalent to 


C(x)T (5) qt? = ¢(1— x) (=) g@-D/2. 


The function ¢(x) plays an important role in number theory. 


Remark. The first satisfactory proof of (14.38) was given by B. Riemann in 1859, 
but the equation itself (called the Riemann equation) together with a motivation was 
already given by Euler in 1768. 


Equation (14.38) is to a great extent characteristic for function (14.37). Namely, 
we have the following. 
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Result 14.8. (Hamburger [338], Kuczma [554], Hecke [362]). Let #(x) be analytic 
in every finite domain except at most a finite number of poles, and suppose that there 
exist positive constants r and wt such that 


Ip(x) < el" for |x| =r. 


Further, let (x) be expansible for Re x > 1 in an absolutely convergent Dirichlet 


series 
[oe] 


(x) = Do ann™, 


n=1 


and let the function y(x) defined by 


yw — x) =2(7)™ cos (xx) T'(x)A(x) 


be expansible in a Dirichlet series 


CO 
y(x) = Sie 
n=1 


absolutely convergent on a vertical line Re x = ug > 0. Then (x) = w(x) = 
cC(x), where c is a constant. 


14.3.1 The Theta Function 


Given functions 7, @ : R” — C, define the subspaces V(y, ¢) and W(x, ¢) of the 
complex vector space (R”)© of all complex-valued functions defined on R”. The 
space V(y, ) is the span of all functions ur y(u+v)¢(u —v) with fixed v € R”, 
and W(y, ¢) is the span of all functions v  y(u + v)d(u — v) with fixed u € R”. 
Note that Viv.) = Vig,y). If y = ¢ = @, we simply write V(@) for V(@,@) and 
W(6) for W(@, 4). Identifying the space C” with R?”, we can also use this notation 
for functions y and ¢ defined on C”. 


Definition. Let u = (€|,6,...,é,) € C”. An analytic trivial theta function is a 
function y : C” — C that can be written as 


wu) =cexp(B(u) + a(u)) foru eC", 


where c € C* is a constant, a : C” > C, given by 


n 


alu)= > ag, a €C, forv e {l,...,n}, 


v=1 


is a C-linear mapping, and £6 : C” —> C, given by 


n 
BU) = Do bruSiSus boyy €C, for v, uw € {1,....n}, 


v,g=1 
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is a complex-valued quadratic form where u = (€1,é),...,¢,). Alternatively, the 
analytic trivial theta functions are exactly the functions of type u F exp(P(u)), 
where P : C” > Cis ann-variable polynomial with complex coefficients of degree 
at most 2. 


Equations of the Type (14.39) 


(14.39) xU + v)PU — v) = fiW)giC.) + fro)g2(0), 


where y, ¢, fi, f2, 21, 82 are complex-valued functions on R", n € Z% arbitrary, 
and y and ¢ are continuous is well known in the theory of theta functions of one 
complex variable. There they are called addition formulas for theta functions. Apart 
from degeneracy and some obvious modifications, theta functions are the only con- 
tinuous functions satisfying an addition formula of type (14.39). 


Result 14.9. (Bernoulli [121]). Suppose y,@ : R" — C are continuous functions 
for which there exist functions f\, f2, £1, g2 : R" — C so that (14.39) is valid for 
allu,v € R". If y 4 Oand ¢ #0, then we have the following possibilities for the 


pair x, @: 


(al) x) = ci exp(BWu) + a1(¥))Gi + au), 
p(u) = cr exp(B(u) + a2(u)), 

(a2) xu) = ci exp(B(u) + a1 (u)) sin(@i + a(u)), 
p(u) = crexp(Blu) + a2u)), 

(bl) x(u) =crexpB) + aw) + alu)), 
p(u) = crexp(Bl) + a2(u))(C2 + a(u)), 

(62) x(u) = cr exp(Bu) + a1 (u)) sin + au), 
p(u) = crexp(BlU) + a2(u)) sin(d2 + a(u)), 

(63) x(u) = cr exp(B) + ai))o(G + atu)), 
p(u) = crexp(Bl) + a2(u))o C2 + alu). 


Here c1,c2 € C*, 1,62 € C ave constants, a1,02,a : R” — C are R-linear 
mappings, B : R” — C is a complex-valued quadratic form, and o is a Weierstrass 
o -function. 

In cases (a1) and (a2), the roles of x and ¢ may be reversed. Conversely, each of 
the listed function pairs satisfies the functional equation (14.39). 


14.4 Singular Functions 


A nonconstant g : [0, 1] — [0, 1] is called (strictly) singular if it is continuous and 
(strictly) increasing with g’ (x) = 0 a.e. Singular functions are not absolutely contin- 
uous. We consider now three singular functions associated with Cantor, Minkowski, 
and de Rham [412]. 
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14.4.1 Cantor-Lebesgue Singular Function 
Cantor’s ternary set W is obtained by removing from [0 


1 
12 12 7 8 Ie 32 
(5.3), er= (33), 22= (5.8), bas (aH) and 


= 2Xn 
Waar tn 0,0) 


—— 


n=1 


The Cantor function C : [0, 1] — [0, 1] is defined by 


a 5 2tn San Rie !. § : 
pag — a an (x) := ri orx € Ink. 


n=1 


C was introduced in 1884 by G. Cantor [141], who had already stated its singularity. 
The function C later appears in H. Lebesgue’s classic book on integration [602] and 
is referred to by some authors as Lebesgue’s singular function. 

C satisfies 


1 1 
1 
| C'(x)dx =0, i C(x)dx = =, 
0 0 2 
1 
| Vv14+(C’(x))?dx =1,  arclength (graph C) = 2. 
0 
Sierpinski [739] showed that C satisfies the functional equations 
x 1 x+1 1 
t (5) = 5/0). | 5 )=3 


(EF) = sre+5. ret, 


(14.40) 


and proved the following characterization: Assume that f : [0,1] — R is bounded 
and satisfies (14.40). Then f = C. (See also [554].) 


14.4.2 Minkowski’s Function 


H. Minkowski introduced the function K : [0,1] — [0,1] in his Geometrie 
der Zahlen [636]. The aim was to get a geometrical picture of Lagrange’s cri- 
terion for real quadratic irrationals. This function was in fact a bit mysterious, 
and even now some of its properties are hard to obtain. K is defined as follows. 


K(0):=0, K():=1. ie and B are two adjacent Farey fractions (pq'—qp' = 1), 


then 
/ 1 / 
« (7) = a1 (2) +*(7)], 
q+q' 2 q q’ 
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For example, 


K : — 1 KE Ka a2 
(5) =5! y+ (DiS: 
1 


K (<) =5 Ko) +K (=) = ; 


The remaining values are obtained by continuous extension. 

K is a strictly singular function. The connection with Lagrange’s criterion “‘x is 
a quadratic irrational iff its simple continued fraction is periodic” is as follows. Let 
x € (0, 1) be given by a simple continued fraction [0, aj, a2,...], ay € X. Then 


K(x) = Sept igiartart a), 
n=1 
Thus we have the equivalent statements 
(1) [0, a1, a2,...] finite, (ii) x rational, (aii) K (x) dyadic rational 
and 
(1) [0, a1, a2,...] periodic, (ii) x quadratic irrational, (iii) K (x) nondyadic 
rational. 


A first hint on functional equations for K can be found in the 1956 paper of 
G. de Rham [702]. There he considers the system 


x f(x) x+1 
14.41 (=) Sipee na ee =f(x)+1, xe€[0,)), 
( ) Pls T+ f@) (SE )-ro x € [0, 1) 
which has a unique positive solution m; m is continuous and strictly increasing with 
lim, m(x) = co. 

K satisfies 


x 1 
(14.42) 2K (=) = K(x), 2K (—) =K(x)+1, x € [0,1]. 


The most comprehensive facts on functional equations and characterizations for 
K has been given by R. Girgensohn in [313]. He showed that K satisfies the func- 
tional equations 


x 1 
(14.43) f (- = :) = 5F@), x € [0,1], 
(14.44) f@w+fd-x*)=1, x € [0,1], 
1 1 
(14.45) f (—) =l|1- 5 F&); x € [0, 1], 


1 | | 
(14.46) (=) 5 5 FO); x € [0,1], 
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and proved the following characterizations: 


Assume that f : [0,1] — R is bounded and satisfies (14.43) and (14.45). 
Then f = K. 


Assume that f : [0,1] — R is bounded and satisfies (14.43) and (14.46). 
Then f = K. 


Moreover, he showed that (14.43) and (14.45) together imply (14.44) and (14.46) 


but that there are unbounded solutions of (14.43) and (14.46) that do not satisfy 
(14.44). 


In proving the functional equations and the characterizations, continued fractions 
are essentially used. 


14.4.3 De Rham’s Function 


G. de Rham proved in [702] the following theorem: Fix a € (0,1). Then there is 
exactly one bounded Rg, : [0,1] > R that satisfies the functional equations 


1 
(14.47) ff (5) =af(x), f (= ) =(l—a)f(x)+a, x€[0,1]. 


R, is strictly singular for a # 5; R1/2(x) =x. Rg is given by 


= So /1—a\" 
R,(0) = 0, re(S2-")=>( - ) 


n=0 n=0 


YO<YL<y2<-++<m EN. 


The main property of Rz, its strict singularity, was proved by de Rham essentially 
using the defining functional equations, not its representation given above. He further 
provided a probabilistic interpretation of his system (14.47). De Rham was aware of 
the fact that R, had already been investigated by E. Cesaro in [148], etc., but without 
the use of functional equations. 

It is amazing that the strictly singular R, has applications in real life. Let f(x) 
be the probability of eventual success under bold play for a gambler with initial 
fortune x and a@ the probability of winning a turn. Then f/f satisfies (14.47), and 
hence f = Rg. 

Furthermore, R,, was used by B. Bernstein, T. Erber, and M. Karamolengos [126] 
to describe certain aspects of the behaviour of plastics. 

L. Takacs in [800] introduced the functions T, : [0,1] > R, p € (0, 00), given 
by 


T,(0):=0, 1, 6 >) = kate, 
k=0 


k=0 
Vo <Vp <2 <-::<yeEN. 
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He proved that 7; is strictly singular for p 4 1 using the representation given above. 
It is easy to check that T;, satisfies the functional equations 


£(3)=() re. 


x+1l\_ p 1 
r( 5) )- row, x € [0, 1], 


(14.48) 


and is its unique bounded solution. In fact, T, = Ri/(i+p) for p € (0,00). We 
add the observation that R, = R, — a satisfies the single replicativity equation 


f£() + fA) = fF) on (0, 1]. 


15 


Miscellaneous Equations 


This chapter is devoted to solutions by the methods of determinants and means and 
revisits logarithmic, trigonometric, and polynomial (mean value theorem) functions 
and inner product spaces. 

We start off with a method known as the method of determinants, due to 
E. Vincze, and solve many of the equations solved before. 


15.1 A General Method: Method of 
Determinants 


E. Vinceze [818] has introduced a method of solving functional equations applicable 

to a rather large class of equations. It is based essentially on the notion of linear 

dependence and its expression in terms of identities involving determinants. 
Consider the determinant 


Fy (z1) Fo(z1) «++ Fn(z1) 
F\(z2) Fo(z2) «++ Fn(z2) 


F\ (Zn) Fon) +++ Funan) 


consisting of complex or real functions defined in some arbitrary subset S of R or C. 
We can abbreviate this by 


A[Fi(z1), F3(z2), cae Fi(Zn)] = A[Fi, Fo, «83 07D Fy]. 


Some Properties 


Suppose that the n(k + 1) functions F; : S > C @ = 1 ton(k + 1)) satisfy the 
identity 


k 
(15.1) > A[Fynti (21)5 Font2(Z2)5 -- +5 Pontn (Zn)] = 0 
v=1 
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for all z1, Z2,...,Zn in S. Then 
k 
(15.1a) > A[Fyon+1(1), Fyn42(Z2), tees Fyntn@n)s Fo(n+1)] =0 
v=l1 
also holds for all values of z1, Z2,..., Zn, Zn+1 in S for an arbitrary Fo. Vincze calls 


the passage from the first equation to the second “extension or expansion by Fo(z)”. 
It is the basic tool in his discussion (see also [12]). 

Let us further note that F;, Fo,..., F, from S to C are linearly independent in 
S CC; that is, there exists cj to cy not all zero such that 


n 
> ah© =0 
i=1 
if and only if 


(15.2) AL[Fi (21), F2@2),---, Fn@n)] = 0 


for all z}, Z2,..., Zn in S [12, 818]. 


Remarks. If the functions F|,..., F, are linearly dependent, one of the following 
holds: 


(i) F,(z) = 0 for all z € C. 
(iit) Fe(z) = best Feai(z) +--+ + bn Fn(z), where the by (kK = 2,...,n) denote 
complex constants. 


We note that in equation (15.1) we can introduce a parameter w, and it follows 
from 
A[Fi(zi + w), F2(z2),---, Fn@n)] =0 


that 
bi (w) F(z + w) +--+ + bn (w) F(z) = 0 


for all z,w € C. 
From this it follows as above that at least one of the following equations holds: 


(i) F,(z) = 0 forall z € C. 
(ii) Fy(z) = bei Fra (z) aE seo a bn Fn(Z) (k =2,..., n). 
(iit) Fy(z + w) = b2(w) F2(z) ++ ++ + bn (w) Fy (z), where by (k = 3,...,n) denote 
complex constants and the by(w) (k =2,...,m) are complex functions. 


These remarks will be exploited in what follows. 


Vincze illustrates his method with a number of examples. We will illustrate a 
couple of them. The first is one of the simplest. 


Example 1. It is desired to find functions F, G, H, K, N satisfying the equation 


(15.3) F(z1 022) = G(z1) + A (z2) + K (21) N(Z2) 
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for z1, Z2, Z1 © Z2 belonging to a set S (see Chapter 1, (1.94a,b)) that is an Abelian 


6699 


semigroup with respect to the operation “o”. In particular, it is assumed that there 
is a point a € S such that the equation a o z = c has at least one solution for every 
given c. 

Since the left side of (15.3) is symmetric in z; and zo, 


G(z1) + H(@2) + K(Z1)N(Z2) = G(@2) + A(Z1) + K@2)N (1), 
or in determinant notation 
(15.4) A(G,1)+ AC, H)+ A(K, N) =0. 
Now we “extend by 1” to obtain 
A(G, 1,1) + AQ, H,1)+ A(K, N, 1) =0 


and then 
A(K, N, 1) = 0; 


that is, the functions K, N, 1 are linearly dependent. Then, by the remark, there are 
two possibilities, either 


(QN@)= fi 
or 
(2) K(z) = Bi + BoN(Z). 
We shall discuss case (1). Substituting in (15.4), we obtain 


A(G, 1) + AC, H) + ACK, fi) = ACG, 1) — ACA, 1) + A(A1K, 1) 
= A(G—H+iK,1)=0. 


Thus there is a constant /2 such that 
G(z) — H(z) + fi K(@) = 2fr. 
Combining, we get from (15.3) 
F(Z: 022) = [G(z1) + Bi K @1) — £2] + [G(Z2) + Bi K @2) — Aa], 


which is the Pexider equation (PA), (PL) considered in Chapter 1. Thus the final 
solution in case (1) is given by 


FQ) = AZ) +2f3, Gz) = AZ) — Ai K(@) + fot fs, 


(15.4a) ; 
ee =A) + f3— fo, K(z) arbitrary, N(z) = fi, 


where A(u ov) = A(u) + A(v). 
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Case (2) is more complicated and also involves the solution of another Cauchy 
equation, 
E(Z1 022) = E(@1)E(z2). 
Aside from this, no new principle is involved. We briefly indicate how the proof 


goes. 
Substituting (2) in (15.4) yields 


A(G,1)+ AQ, H) + A(Ai + BN, N) = A(G — H — BiN, 1) =0; 
that is, 
G(z) = H(z) + BiN + fs, 
which in (15.3) gives 
F(Z) 022) = G(z1) + G2) + 2N(@1)N (22) — As. 
Using associativity of o, we get 
A[G(z1 0 w), 1] + Al], G@2)] + AIN(G1 © w), f2N(@2)] = 0. 
Expanding by | results in 
A[N(Z1 0 w), P2N(Z2), 1] = 0. 
Now apply the remark a couple of times. The remaining solutions of (15.3) are [818] 
F(z) =a E(z)+ A(z) + a, 
G(z) = a3E(z) + A(z) + aa, 
(15.4b) H(z) = asE(z) + A(z) + G6, 
K(z) = a7E(z) + a8, 
N(z) = agE(z) + a10; 
F(z) = a A(z)? + a2A(z) + Ai) + 23, 
G(z) = a A(z)* + Ai (z) + a4, 
(15.4c) H(z) = a A(z)? + asA(z) + Ai(z) + 6, 
K (z) = 2a, A(z) + 47, 
N(z) = A(z) + as, 
where the constants a; satisfy certain conditions, and A and E are as given above. 
Vincze has also applied his method to generalized sine and cosine equations 


where addition is replaced by an associative operation o. The method appears to 
be quite powerful. 


Example 2. We consider now the “trigonometric” functional equation 
(15.5) C(z + w) = C(z)C(w) — S(z)S(w), 


where C,S : C — C (see Chapter 3, (3.72)). We shall determine the most gen- 
eral complex solutions of this equation by using the method of determinants in the 
following theorem. 
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Theorem 15.1. [818]. Suppose C, S : C > C satisfy equation (15.5). Then C and 
S are given by 


1 1 
(cl) C@)= ap le + P)E2(2) — (a - AE1@)), S@= ap B®) — Ex(z)), 


with a2 — 6? = 1, B £0, 


(c2) C(z) = E(Z)A4A(z)), S(z) = E(Z)A(z), 


where A is additive, E, and E2 are exponential (E), and a and f are complex con- 
stants. There are no other solutions. 


Proof. We consider solutions c,; and cp. It is clear that with the mentioned restric- 
tions on the constants a and /, these functions actually do satisfy equation (15.5). 
Therefore, we need only show that all solutions are either of the form (c1) or (c2). 
The idea is first to obtain a formula for S(z + w) similar to C in (15.5). 

By the associativity of addition +, we have 


Cz+¢4+w)=C(e+0)C(w) — SZ + O)S(w) 
=C(E + w)C(z) — SE + w)S(z), 


from which we obtain 
(15.6) A[C(Z + ¢), C(w)] — AIS@ + 2), S(w)] = 0, 
which by “enlarging” by C(u) we obtain 
A[SZ + ¢), S(w), C(u)] = 0. 
Hence we have by the remark one of the following cases: 


(i) C(u) = 0. 
(i) S(z) = 61C(z) (fi constant). 
(ii) S@ +) = a1 (C)S(Z) + a2(C)C(). 


(i) If we set C(z) = 0 in (15.5), then S(z1)$(z2) = 0, and hence $(z) = 0. This 
solution is contained in both c; and co. 
(ii) In this case, we obtain from (15.5) the Pexider functional equation (PL) 


C(z+w) = (1— BC): C(w). 


Thus, with 1 — 27 40, 


il 
CZ) = fp. 
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where E} is exponential, and then from (ii), 


Ai 


1-£ 


S(z) = E\(z), 


which gives (cl). 
We should stress that if C and S are going to be linearly dependent, either 
(i) or (ii) must then hold. We therefore assume in what follows that they are 
linearly independent. 
(iii) We notice the symmetry of the left side of (111) and see that 


(15.6a) A(a1, S) + A(az2,C) =0. 


“Enlarging” by C yields 
A(a1, S,C) =0. 


But this equality can only hold (see remark) if 
(15.7) ai(z) = BiS(z) + f2C(z) (Ai, fe are constants) 


since we supposed the functions S, C to be linearly independent. 
If we use (15.7) in (15.6a), we obtain 


A(f,S + foC, S) + A(az, C) = A(foC, S) + A(az, C) 
= A(—f2S, C) + A(a2, C) 
= A(a2 — f2S,C) = 0. 


Since C 4 0, we have 
(15.8) a2(z) — B2S(z) = £3C(z) (3 constant). 
Making use of (15.7) and (15.8), we obtain from (iii) 


S(z + w) = a (w)S(z) + a2(w)C(z) 
= [f2S(w) + B2C(z)]S(z) + [B2S(w) + f3C(w)C(z) 
(15.9) = Bi S(z)S(w) + foS(z)C(w) + B2S(w)C(z) + f3C(z)C(w), 


a formula similar to (15.5) for S. 
In order to obtain further restrictions on the constants £1, f2, £3, we set (15.5) 
and (15.9) into (15.6): 


A[C1)C(C) — S(Z)SC), C(w)] — ATA S@)SO) 
+ B2S(Z)C(C) + B2S()C(]Z) + BCC), S(w)] 
= C(C)ATC(Z), C(w)] — S)ATS@)CWw)] 
— A SC)ATLS(Z), S(w)] — PaC(Q)ALS(Z), S(w)] 
— B2SC)AIC(Z), S(w)] — BCC)ALC(), Sw)] 
= [0 — f2)S() — ABC(C)AIC(Z), S(w)] = 0. 
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Because C and S were assumed linearly independent, we have 
A[C(Z), S(w)] #0 


and hence 
(1 = f2)S(C) — AsC(@) = 0, 


from which it follows since S and C are independent that £2 = 1, £3 = 0. Now 
equation (15.9) can be written as 


(15.10) S(z + w) = fi S(z)S(w) + S(z)C(w) + S(w)C(Z). 

Now we determine y so that 

(15.11) C@+w)+yS@+w) = (C@)+yS@))(CWw)+ySW)), zwec. 
Utilize (15.5) and (15.10) in (15.11) to have 


C(Z)C(w) — SZ)S(w) + y Bi SZ)S(w) + y S(Z)C(w) + y S(W)C(Z) 
= C@C(w) + yC@S(w) + y S@)CW) + y7S@)S(w); 


that is, since S £0, y? — yf} +1 =O0or 


_hitybi-4 


? 2 


Case i. Roots are distinct, say r}, r2. Let 
E;(z) =C@)+7S8@), i=1,2. 


Then the £;’s are exponential. Putting r equal to), r2 in (15.11) and subtract- 
ing one from the other, we get 


(r) — r2)S(z + w) = E,(z)Ei(y) — E2(z)E2(y) 


or 


S@)= (E1(@) — Ex@)), 11 #12. 


r= 12 


Use 
C(z+tw)trnS(z+w) = E,(z)E\(w) 


and an expression for S to get 


C(z) = 


[r) E2(z) — r2E\(z)], 
r—r2 


which gives (cl). 
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Case ii. Roots are equal. Then 6; = +z andr = +1. Now (15.11) becomes 
E(z+w) = E(z)E(w); that is, E is exponential, where E(z) = C(z) + S(z). 
From (15.9), we have 


S(z+ w) = S(z)E(w) + S(w) E(z); 


that is, 


SZ+w)  S@) z S(w) 
E(z+w)  E(z) E(w) 


and 


S(z) = E(z)A(z), 


where A(z) = a 


C(z) = E(z) = S(z) = E(z)C FAG)), 


and is additive. Finally, 


which gives (c2). 


Some of the other equations solved by this method include 


S(z + w) = S(z)C(w) + S(w)C(z), 
C(z + w) = C(z)C(w) + S(z)S(w), 
S(z+w) = G(z)H(w) + K (z)N(w), 
where S,C,G,H,K,N:C-C. 
Vincze also proved the following result (see also Chapter 1). 


Result. (Vincze [819]). Let S(«) be an Abelian semigroup. Then the functional 
equation 


f@ *y)" =[f@) + FO)" 
is equivalent to f (x * y) = f(x) + f(y), where f :S—> C. 


15.2 Means 


Now we devote this section to various means and the functional equations connected 
with them. We start off with the weighted arithmetic mean (w.a.m.), 


ax + by 
“am. = =(1- ; 
w.a.m ae (1 — p)x + py 
forx,y €R, a,bER, 0O<p<1, p=. 


Note that a = | = b yields the arithmetic mean 
x+y 
= 


A.M. = 
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Theorem 15.2. (See also [12, 44].) The only solution of the transitivity equation 
(15.12) Fat+t,yt+t)=F(x,y)+t, x,y,t ER, 
and the homogeneity equation 
(15.13) F(xu,yu)=F(x,y)u, x,y,ueER, 
is the w.a.m. where F: Rx R= R. 
Proof. Set y = 0, x = 1 in (15.13) to get 
(15.14) F(u,0) = F(,0)u = (1— p)u, paconstant. 
Put ¢ = —y in (15.12) and use (15.14) to have 


F(x,y)=Fa—y,0)+y 
=(1-p)@-y)+y 
= (1— p)x + py, 


which is w.a.m. This proves the theorem. 


Remark. Note that the pair of equations is solved without assuming any regularity 
condition on F. 


A considerable amount of literature (see [623]) about the concept of the mean 
(or average) and the properties of several means (such as the median, the arithmetic 
mean, the geometric mean, the power mean, the harmonic mean, etc.) was produced 
in the nineteenth century and often treated the significance and the interpretation of 
these specific aggregation functions. 

Cauchy [147] considered in 1821 the mean of n independent variables 
X1,...,X, as a function M(x1,...,%,), which should be internal to the range of 
the x;’s values: 


(15.15) min{x1,...,%,} <M(xq,...,%) < max{x1,...,Xn}. 


The concept of the mean as an average is usually ascribed to Chisini [153]. The 
solution of Chisini’s equation corresponds to the arithmetic mean, the geometric 
mean, the quadratic mean, the harmonic mean, the exponential mean, and the anti- 
harmonic mean, which is defined as 


2 
Dui %j 
Dai Xi 
Unfortunately, Chisini’s definition is so general that it does not even imply that 


the “mean” (provided there exists a real and unique solution to the equation above) 
fulfills Cauchy’s internality property (15.15). 


(15.16) M(x1,..-5Xn) = 


572 15 Miscellaneous Equations 


In 1930, Kolmogoroff [543] and Nagumo [642] considered that the mean should 
be more than just a Cauchy mean or an average in the sense of Chisini. They de- 
fined a mean value to be an infinite sequence of continuous, symmetric, and strictly 
increasing (in each variable) real functions 


Mi (x1) = 1, M2(x1, x2), ..-, Mn (01, .--5.%n),--- 


satisfying the reflexive law, M,(x,...,x) =x for all n and all x, and a certain kind 
of associative law, 


Mx (x1, ..-,Xk) =x 
(15.17) => Mn(xX1,..-, Xk Xk41,-+-5Xn) 
= Mal, vee XS XK+1, ioe Nn) 


for every natural integer k < n. They proved, independently of each other, that these 
conditions are necessary and sufficient for the quasiarithmeticity of the mean; that is, 
for the existence of a continuous strictly monotonic function f such that M, may be 
written in the form 


1 n 
(15.18) Maltiy...%n)=f'|— Df], ne Zh. 
i=l 


The quasiarithmetic means (see also Chapter 17) comprise most of the algebraic 
means of common use and allow one to specify f in relation to operational condi- 
tioning: 


Examples of quasiarithmetic means 
f (x) Mn(X1,.--5 Xn) 
arithmetic 


quadratic 


geometric 


harmonic 


i. 
x* (a € Ro) (4 Day power 


e“* (a € Ro) tlog| +e | exponential 


The properties above defining a mean value seem to be natural enough. For 
instance, one can readily see that, for increasing means, the reflexivity property is 
equivalent to Cauchy’s internality (15.15), and both are accepted by all statisticians 
as requisites for means. 
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Associativity of means (15.17) was introduced first in 1926 by Bemporad [116, 
p. 87] in a characterization of the arithmetic mean. Under reflexivity, this condition 
seems more natural, for it becomes equivalent to 


Mx(x1, i: 1.5 Xk) = Mx(x}, are 25 hp) 
J 


/ ‘ 
Min (1, «.-s Xks Xk415 ++ Xn) = My (x1, - 6-5 XpsXk+1s ++ +5 Xn), 


which says that the mean does not change when altering some values without mod- 
ifying their partial mean. This property is called “decomposability” in order not to 
confuse it with the classical associativity property. Some means, however, do not be- 
long to this family: The antiharmonic mean (15.16) is not increasing in each variable, 
and the median is not associative in the sense of (15.16). Kolmogoroff and Nagumo 
proved that the quasiarithmetic means are exactly the decomposable sequences of 
continuous, symmetric, strictly increasing in each variable, and reflexive functions. 

Observe, however, that this concept has been defined for symmetric means. 
When symmetry is not assumed, it is necessary to rewrite the decomposability prop- 
erty in such a way that the first variables are not privileged. Marichal et al. [624] 
proposed the general form 


Mei, «+ %) =% => Mn | Do miei + Do miei 


ieK igK 
(15.19) 
= Mn di xe + >) xe: 
ieK idK 
for every subset K = {i1,...,i¢} C {1,...,n} with i) < --- < i, (e; represents 


the unit vectors of R”). Of course, under symmetry, this definition coincides with 
(15.17). 

We now show that symmetry is not necessary in the Kolmogoroff-Nagumo char- 
acterization, provided that decomposability is considered in its general form. Thus 
we show that any decomposable sequence (M;,),<N, Of continuous, strictly increas- 
ing, and reflexive functions is a quasiarithmetic mean value (15.18). 

The characterization of the quasiarithmetic mean was done by using some of its 
simple properties. 

We first show that, for any decomposable sequence (Mn )neN, Of reflexive func- 
tions, the two-place function M2 fulfills the bisymmetry functional equation 


(15.20) M2(M2(x1, x2), M2 (x3, x4)) = M2(M2 (x1, x3), M2 (x2, x4)). 


The bisymmetry property (also called mediality) is very easy to handle and has 
been investigated from the algebraic point of view by using it mostly in structures 
without the property of associativity—in a certain respect, it has been used as a 
substitute for associativity and also for symmetry (see Chapter 15). 
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Proposition 15.3. (Marichal [623]). For any decomposable sequence (Mn)neNo of 
reflexive functions, we have for allk,n € Zi, (n = 2): 


(1) Min Qi, 65 X1ks - 6S Xnty +e Xnk) = Ma (Meri, ...5 X1k)) 5 
Mk (Xnt, ++ +5 Xnk))- 
(2) The function M2 fulfills the bisymmetry functional equation (15.20). 


Proof. We have successively 


M2(M2(x1, x2), M2 (x3, x4)) 
= Mg(x1, x2, x3, X4) (by (1)) 
= M4(M2(x1, x3), M2 (x2, x4), M2(x1, x3), M2(x2, x4)) (decomposability) 
= M2(M2(M2(x1, x3), M2 (x2, x4)), 
M2(M2 (x1, x3), M2 (x2, x4))) (by (1)) 
= M2(M2(x1, x3), M2(x2, x4)), (reflexivity) 


which proves the result. 


Now, consider a decomposable sequence (Mp )ne zt of continuous, strictly in- 
creasing, and reflexive functions. Since Mp? fulfills the bisymmetry equation, it must 
have a particular form. Actually, it has been proved by Aczél [12] that the general 
continuous, strictly increasing in both variables, reflexive, and symmetric real solu- 
tion of the bisymmetry equation (15.20) is given by the quasilinear mean. 


Result 15.4. [12]. Let I' be any real interval, finite or infinite. A two-place function 
M :I' x I' + Ris continuous, strictly increasing in each variable, reflexive, and 
symmetric and fulfills the bisymmetric equation (15.20) if and only if there exists a 
continuous strictly monotonic function f : I’ — Rand a real number a € 0, 1 
such that 


M(x, y) =f 'laf(x)+(—a)fQ)] forallx,y eT’. 


Result 15.4a. [623]. Let I’ be any (finite or infinite) real interval and {Mn}nen, be a 
decomposable sequence of continuous, symmetric, strictly increasing, and reflexive 
functions M, : (I1')" — R. Then and only then there exists a continuous strictly 
monotonic function f : I' + R such that, for alln € Z*, 


1 n 
(15.18) M,(x1,--.,Xn) = fo! [FX r00] for X1,...,%X, EI’. 


As already mentioned, the symmetry property is not essential in the previous 
result. 


Result 15.4b. [623]. Let I’ be any (finite or infinite) real interval and {M,} be 
a decomposable sequence of continuous, strictly increasing, and reflexive functions 
M, : (I’)" — R. Then and only then there exists a continuous strictly monotonic 
function f : I’ > R such that, for alln € Z*,, 
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1 n 
(15.18) My(x1,.--.Xn) =f! [FS s00] for x1,...,%, EI’. 


Definitions. A function M : R* — R is said to be a quasilinear function provided 
there exists a continuous and strictly monotonic function f : R — R and constants 
a, b, c such that 


(15.21) M(x,y)= f '@f@)+bf(y)+c), a,b 40, ¢,x,yeER. 


Whena+b=1andc=0, 


(15.22) Ma, y)=f '(l-a)f@)+afQ)), x yeER, 
is called a quasilinear mean. When a = 5; 
(15.18a) M(x, y) = f7! (ee). forx,y ER, 


is called a quasiarithmetic mean. 


Result 15.4c. (Hosszt [378]). The most general cancellative, continuous solutions of 
the right and left autodistributive laws (xy)-z = (xz)- (yz) and z- (xy) = (zx): (zy) 
are the quasilinear means (15.22). Let M : R* — R be increasing in its first 
variable, have partial derivatives of first order, and satisfy (right) autodistributivity. 
Then M is a quasilinear mean (15.22). 


15.2.1 Characterizations—Arithmetic and Exponential 
Means 


Theorem 15.5. [44]. The only solutions M : R* — R of the transitivity (15.12) and 
quasiarithmetic mean (15.18a) are 


x+y 


. 1 ex~+e 
A.M. = , exponential mean (E.M.) = — log 5 : 
c 


where c (& 0) is a constant. 
Proof. Using (15.18a) in (15.12), we get 


pot (ee Po) = (AO) 


forx,y,t ER. 
Let k;(x) = f(« +f) for a fixed t. Note that f is continuous and strictly 
monotonic. Then the equation above can be written as 


fo (ae) at (ee) ice 


576 15 Miscellaneous Equations 


Replace x by f~!(x) and y by f~!(y) to obtain 


CFV HIT O 2 [pt (242) seen (0-1 (42)) 


by the definition of k; that is, 


aay (ety) &FTDO)+ &F7O) 
(k; f (=) = nt ann: 


which is Jensen (J). So there exist an additive A : R — R (continuous since f is) 
and a constant b € R such that 


(ke f-')\Qx) = A(x) +b, forx ER, 


or 
f@tt) =k) = AGF) +b =ch@) +b =c@) fe) +b0, 


where c and b are functions of t, whichis a special case of (1.94a). By Theorem 1.50, 
we have either f(x) = ax + f or f(x) = ae’* + f, a, f,y € R. The first f 
in (15.18a) yields A.M. and the latter f in (15.18a) yields E.M. This proves the 
theorem. 


15.2.2 Geometric Mean and the Root Mean Power 


Theorem 15.6. (See [44], [250], and Chapter 17, Section 17.1.2). Let M:R—-R 
satisfy the homogeneity (15.13) and the quasiarithmetic mean (15.18a). Then the 
only solutions are either 


M(x, y)= (xy)? 


(geometric mean, G.M.) or 


xP + yP 5 
2 


M(x, y)= ( 
(the root mean power, R.M.P.), where p € R. 


Proof. The proof runs similar to that in Theorem 15.5. From (15.13) and (15.18a) 
there results 


A(f@+F/O) _ 1 (FEW + fou) 
mee (ee). 


Replace x by f~!(x) and y by f—!(y) to get 


fo (=). _ pf fF" eow + FEO) 
2 _ 2 ; 
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For fixed u, set f,(x) = f~'(«)u, so that we have 


5 (fa (22)) = Lew HHO, 


which is Jensen in ff, so that 
(ffu)(x) = A(x) +b 


and 
f(f7' (xu) = A(x) +b 
or 


f (xu) = A(f@)) +b = clu) f(x) + bw), 


where c and b are functions of u, which is a special case of (1.94b) (also (1.94a)). 
By Result 1.61, we obtain either 


f(x)=alogx +f or f(x)=ax?+f, a, B,peR. 


From the first form, we get the G.M. (x y)2 , and from the second form, we get the 


1 
cPLyP\ Dp . . 
root mean power (=) ” This completes the proof of this theorem. 


Remark. From the last two theorems, it follows that if 


1f{fO+fO\  -1fs@)+80) 
( 2 )=s ( 2 ) 


holds, where f, g : R — R are continuous, strictly monotonic functions, then there 
exist constants a, / such that 


f(x) =ag(x) +f forallx ER. 


Definition. Let M : I’ x I‘ > R be represented by 


_ p(x)x + pY)y 
ve MOY) = G+ pO) 


where p : J’ — IR is an arbitrary function and /’ is an interval in R. 
Equation (15.23) is a generalization of the quasiarithmetic mean (15.18a) and is 
called an arithmetic mean with weight function. 


Theorem 15.7. (Padles and Volkmann [661]). A function M : I’ x I' + Ris an 
arithmetic mean with weight function if and only if it has the Cauchy mean value 
property (15.15) and satisfies 


(x — M(x, z))(y — M(x, y))(@ — M(z, y)) 
(15.24) =—-(« —- M(x, y))(y — Mt, y))(z — M(x, z)) 
forallx,y,z€l'. 


forx,y El’, 
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Proof. “If part”: (15.23) obviously implies the mean value property (15.15) of M as 
well as (15.24) in the case where at least two of the x, y, z coincide. If x, y, z are 
three different elements from J’, (15.23) yields 

P(x) _ y=M,y) — p@) _ M@,2)-x ply) _ MG@ y)~z 

PQ) MG@,y)-x pe) z-MG,z)’ p@ y-M&,y)’ 


and (15.24) then follows from 


Px) PY), P@) _, 
p(y) pz) pi) 


“Only if part”: Now we suppose that (15.15) and (15.24) are fulfilled. Let x, y, z 
be three different elements of J’. Because of (15.15), we observe that all three differ- 
ences x — M(x, z), etc., occurring in (15.24) are different from zero. We interchange 
y and z in (15.24) and multiply the resulting equation by (15.24). After cancellation, 
we obtain 


(y- My, z)&-M&z, y) =(—y- Mt, y))(z — MV, 2)), 


which implies M(z, y) = M(y, z). So M is a symmetric function. With x = y = z, 
we obtain from (15.23) 


(15.25) M(x,x)=x, xel’. 
Now we fix an element z in /’ and we define p : I’ > R% by 


z—-M(x,z) 3 
(15.26) p(x) = 4 Meas (el a2); 
1 (x =z). 


(The positivity of the values of p is a consequence of (15.15).) Then it follows from 
(15.25), (15.26), and the symmetry of M that (15.23) holds in the cases x = y, x = 
z, and y = z. It remains to verify (15.23) for x # y, both being different from z. 
Therefore, (15.24), (15.26), and the symmetry of M imply 


y—M@,y) _ p(x) 


M(x,y)-x p(y)’ 


which gives (15.23) in this case. 


Remarks 


(i) In the proof of the theorem, we did not use that J’ is an interval. It is enough to 
assume that J’ has at least three different elements. 
(ii) The functional equation (15.24) has solutions that are not of the form (15.23); 


e.g., 


M(x,y)=x, M(x,y)=y, M(x, y) = max{x, y},M(@, y) = min{x, y}. 
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15.2.3 Stolarsky Mean 


Stolarksy introduced in [778] a mean 


1 


xP = yP)\ Fa 
os) , forx,ye Ri, p#qeR’, 


(15.27) Sp, y) = (= — yf) 


called the Stolarsky mean or Stolarsky’s difference mean, that lies between x and y. 
He showed that this mean can be extended continuously to {(p,g,x,y) : p,q € 
R, x, y < 0}. This extension is given below. 


1 
(yr if pq(p — q)(x — y) #0, 
(=)’ if p(x — y) #0, q =0, 
Spgltsy) = } (MBP) if g(x —y) £0, p=0, 
exp(—b + Bat) itgie—y) #0, =a, 


JXy ifx—y 40, p=q=0, 
Es ifx—y=0. 


814 


When g = 1, (15.27) takes the form 


ci 
| for x, y € R*. 
p@—-y) 


(15.27a) Sp(x,y) = 


For x, y > 0, Sp(x, y) is a continuous increasing function of p. 
The Stolarsky mean includes a large class of well-known means such as 


1 
P P\ > 
power mean Ap(x, y) i= (“) = Sopp, y), 
: : ne 
with Ao(x, y) = lim, A p(x, y) = (xy)? = G@, y) = Soo, y), 
p> 
arithmetic mean A(x, y):= — = $21(x,y), 
2xy 
harmonic mean H (x, y) := —— = S_2-1(%, y), 
x+y 
= 


logarithmic mean L(x, y) := a 
nx —Iny 


= So, y) = lim Sp, y), 
p-0 


= lim Spi, y) =: So,1(x, y) 
p—>0 


ee 1 (x*\> 
and identric mean i(x, y):= — {| — 
e \y’ 


= lim Spi@, y) =: S11, y) = Si, y). 
pl 
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Characterizations of Stolarsky Means 


We consider (Lukacs [616]) the functional mean of positive x, y, 


1 
My y)=f! (/ f= aya), 


where f : Ri > R, f” is continuous in R% and f is strictly monotonic in R. 


Lemma. For a twice continuously differentiable function f : R*, — Rand x,y > 
0, set 


1 
g(x, y) =, f(Ax + (1 —a)y)dd. 


Then i 
&xx(E,6) = sf ©) 


where ¢€ is an arbitrarily fixed positive number. 


Theorem 15.8. [616]. 


(a) Let p £ 0,1. Then Mf (x,y) = Sp(x, y) holds for all positive x, y if and only 
if f (t) = at?! + b, where a (4 0), b are arbitrary real constants. 

(b) My(x, y) = So(x, y) holds for all positive x, y if and only if f(t) = $+, 
where a (£0), b are arbitrary real constants. 

(c) M(x, y) = Si (x, y) holds for all positive x, y if and only if f(t) = alogt +b, 
where a (£0), bare arbitrary real constants. 


Proof. 


(a) We present the proof of (a) only [616]. Suppose M f(x, y) = Sp(x, y) holds for 
all positive x, y. Then 


1 


1 P _yP\ p=1 
-1 a Idi (= > ) 
i (/ fix +(1 yaa) — 


1 xP — yP pat 
| fx +(l—A)y)da = f (~~) = g(x,y) 
0 p(x—y) 


for all positive x, y. 

Differentiating both sides with respect to x twice, setting x = €, y = é, 
where ¢€ is an arbitrarily fixed positive number, in the resulting equality, and 
using L’ Hopital’s rule, we obtain gy ,(€, €). By the lemma, we obtain 


and so 


1 " a ” p-2 / 
3f (¢) = an ©) + Te f©), 
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and hence 
a. P=2 
f"O-=-f' ©) =0 
S 
This implies 
f(t) =at?-!+b 


in R4., where a, b are real constants with a 4 0. 


On the contrary, suppose f(t) = at?—! + b, where a (4 0), b are real constants 
and t = Ax + (1 — 2)y. Then 


pO = (—) ~ 
a 


and therefore 


1 
Mr(x,y)=f! GI (ax +(1 nolan] 


_f- ‘a Ray +5] 
Py) 
xP —yP 
~ —" 
= Sp, y), 


for all positive x, y. 


15.3 Some Comments about the 
Logarithmic Function 


1. Suppose f : R%. — R and satisfies 


1 
(i) fxy)— f@)=—-f (<) for x,y € R}. 


Now y = 1 yields f(1) = 0 and x = 1 yields 


f (~) ey. 
y 


so f satisfies the logarithmic equation (L). 
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Remark. Note that f satisfies (15.28). For 


xy 


by (i) 
= F(@+y ous 
=f(xty)— f(xy) 
i 1 
pot y)—s0)+ 4 (2) 
= f(x+y)— f(«*)— f(y), which is (15.28). 


)+fety 


2. Suppose f : Ri. — R and satisfies 
és 1 1 " 
(ii) fay) =-f - —f ; forx,y eR). 


Note that f (1) = 0. y = + gives f (+) = —f (x); that is, f is logarithmic (L). 
Remark. Note that f satisfies (15.28): 


r(+2) =r (4-5) 
x y xy 
by (ii) 1 A i 
eles) tl) 4) 


= f(xt+y)—-— f@)-— ff), which is (15.28). 


3. Suppose f : Ri — R satisfies the functional equation 


| oe | 
(15.28) fiety)- Fe) Foy =F (242) forx,y € Rj. 


The functional equation (15.28) is equivalent to the logarithmic equation (L) 
(Heuvers [367]). (This is (1.69); see Theorem 1.48.) 


Now we consider more functional equations related to (L). 


Summary. Let f : Ri. > R be a real-valued function on the set of positive reals. 
Then the functional equations 


(15.29) fos +y)- for) =F (242) forx,y eR, 


(15.28), and (L) are equivalent to each other. 
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If f, g,h : Ri. — R are real-valued functions on the set of positive reals, then 


(15.30) Ff +y)—gxy)=h (- + ~) 
x oy 


is the Pexider generalization of (15.29). We find the general solution to this Pexider 
equation. 
If f, g,h,k : Ri. > Rare real-valued functions on the set of positive reals, then 


i... 
(15.31) fa t+y)—g@)-hAQ) =k (- + ~) 


is the Pexider generalization of (15.28). We find the twice differentiable solution to 
this Pexider equation. 
There is another functional equation that is equivalent to (L). 


Theorem 15.9. (Heuvers and Kannappan [368]). The functional equation 


1 1 
(15.29) fis+y)-for)=F (242). forx,y > 0, 


where f : IR’. — R is equivalent to the logarithmic equation (L). 


Proof. First replace x and y by x~! and y~! in (15.29) to obtain 
1 1 1 
fe+y+f(—)=r(-+- 
xy x y 


1 1 
for) =-1(=) or £(7)=-r0. fort > 0. 


and 


Setx + y=sandt = 1 so that (15.29) becomes 


xy 


4 
(15.29a) f(s) + f(t) = fst) with = St 
KY 
Here, y = 4, sox + 4 = s or tx? — stx + 1 = 0. Thus 
_ st + J/s2t? — 4t 
7 2 , 
Let s,t > 0. Choose u > 0 such that — <u, + <u, and 5 <u. Now, 
: 4 
f(s(tu)) = f(s) + f(tu) since — < tu (by (15.29a)) 
Ss 


= f(s)+fO+FW since 5 < u (by (15.29a)). 
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But 


f(s(tu)) = f (stu) = f(st) + f(u) _ since = < u (by (15.29a)). 


Thus f(st) = f(s) + f(t) for all s,t > 0. Obviously, any logarithmic solution 
satisfies (15.29). This completes the proof. 


The Pexider generalization of (15.29) is the equation 


(15.30) faty)-gay)=h (-+-) for x,y > 0, 


where f, g, : Ri. — R are real-valued functions on the set of positive reals. 


Theorem 15.10. [368]. The general solution of (15.30) is given by 


(15.30a) f(x) =a+Lx), g(x) =b4+ L(x), h(xv) =a-b+L(x), xe Ri, 
where a and b are arbitrary constants and L is logarithmic on 0, oo[. 


Proof. First set 


(15.32) AM=fO, sO=sO-sQ), mO=sdt+a®, t>0. 


Then (15.30) transforms into 


‘a 
(15,33) fi@+y)-gi@y)=hi (=+ ~) for x, y > 0, 
where gi(1) = 0. Replace x, y by +, 2 in (15.33) to obtain 
1 ae 
(15.34) hi(x+y)+a1 (—) =F (=+-). 
xy x oy 


Add (15.33) and (15.34) to obtain 
1 1 
Pix + y)— Qilxy) = Pi (- “ ~) for x, y > 0, 


where 2Pi(x) = fi(x) + hi(x) and 201(x) = gi(x) — gi(¢) = —21(4) with 
Q,(1) = 0. So we have 


1 1.1 

(15:35) Pix+y)+Q1 (—) = Pj (- + ~) forx,y > 0. 
xy x y 

As before, set x + y = s and a = ¢ so that (15.35) transforms into 


(15.36) Pi (st) = Pi(s) + Qi(t) for at 
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Let s,t > 0. Choose u > iso that xu, set <u, and < u*. Then, 
Pi ((st)u) = Py (st) + Q1(u) since ay < u (by (15.36)). But 


P;((tu)s) = Py (tu) + O1(s) since a < s (by (15.36)) 


4 
= P\(t)+ Q1(u) + Q1(s) _ since a < u (by (15.36)). 
Hence P;(st) = P(t) + Q1(s) for all s,t > 0, a Pexider (logarithmic) equation. 
Thus there exists a logarithmic function L such that 


AM+MO _ 
2 


gilt) — g1 (+) 


(15.37) Pi (t) = a+L(t) fort>0, 


(15.37a) O\(t) = =L(t)  fort>0. 


Note that, putting y = + in (15.33) or (15.34), we get 
1 1 

n(x +2) =s («+2): 
x x 


(15.37b) A®=fO=AO=m@)=a+L@) fort >2. 


that is, 


Subtracting (15.34) from (15.33), we obtain 

(15.38) Po(x + y) — Qo(xy) = —P2 (- + ~) for x,y > 0, 

where 

(15.38a) Po(x) = fitx)—Ai@®) and Qo(x)=g1(%)+ 81 (-) = Qo (<) ‘ 


From (15.37b), it follows that P2(t) = O fort > 2. Set x + y = 2 in (15.38), and 
it follows that 


2 
P2(2) — O22 —x))=—- r(— =) 


for0 <x <2. But2 < a) and 0 < x(2—x) < 1 for0 < x < 2, so it follows 
that Q2(t) = 0 forO < t < 1. Now (15.38a) implies that Q2(t) = 0 for all t > 1 
and thus Qo(t) = 0 for all t > 0. Then it follows from (15.38) that 


1 1 
neena-a(ies) 
xy 


for x,y > 0. Here set x = y = 5, and we get Po(t) = —P,(4 ), which implies that 
Po(t) =O for0 <t < 2. Thus, P(t) = = 0Oforallt > 0. 
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Thus, 


fix) =hi(x) and g (-) =-—gi(x) forx > 0 (by (15.38a)). 


Now, by (15.37) and (15.37a), 


fi) = fx) =h(*x)=a+Lx), forx > 0, 


and 
gi(x)=L(x), forx > 0. 


In (15.32), if b = g(1), it follows that the solution of (15.30) is given by (15.30a). 
This proves the theorem. 


Remark. If P2, Q2 : Ri — R are real-valued functions defined on the positive 
reals with Q2(1) = 0, then the only solutions of 


1 1 
(15.38) Po(x + y) — Qo(xy) = —P2 (- + ~) 


are the trivial solutions P) and Q2 both equal to the zero function. 

By Theorem 15.10, there is a logarithmic function L and constants a and b € R 
such that Py =a+L, 02 =b+L, —P2 =a—b+L. Since Q2(1) =0= L(1), 
it follows that b = 0 and P>7 = a+ L = —-—a—L =O. Thus, L = —a = constant 
and therefore L = 0 anda = 0; i.e., Po = Q2 = 0. 


The Pexider generalization of 


1 1 
fir +y)~ Fo Fo) =F (> +>) 


is the equation 


1 1 
(15.31) Flr +y) = 0) WG) =k(2 +2), 
where f, g,4,k : ]O, co[ > R are real-valued functions on the set of positive reals. 
Theorem 15.11. [368]. The twice differentiable solution of (15.31) is given by 
f@) =—alogx +bx +c}, 


d 
g(x) = —alogx +bx — —4+¢, +03, 
(15.31a) x 


d 

h(x) = —alogx + bx — ——c2-03, 
x 

k(x) = —alogx +dx + c2, 


where a, b, ci, C2, €3, and d are arbitrary constants. 
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Proof. Differentiate (15.31) with respect to x and y to obtain 


Veh a 
rety=(5)# ee. 


Now sets = x+y andt = ++ 7 to obtain f”(s) = EK) or 


(15.39) f(s) =tk'(t) for st > 4. 


Set s = 4 in (15.39) to obtain t7k’(t*) = a = constant for t > 1 and set t = 4 in 
(15.39) to obtain s* f(s) =a for s > 1. 

For 0 <¢t < 1, chooses > 4 s0 that st > 4. Then, from (15.39) it follows that 
f and k satisfy the differential equations s* f’(s) = a and t7k’(t) = a fors,t > 0. 
After two integrations for each function, it follows that 


f(x) =—alogx+bx+c; and k(x) =-—alogx+dx+c2 
as in (15.31a). Putting these into (15.31), we obtain 
d od 
g(x) +h(y) = —alogx —alogy + bx + by — ie a — C2. 
Separate the x and y terms in the above to get 
d d 
g(x)+alogx —bx + =a =—|h(y)+alogy —by+ ri =C3, 

with c3 a constant. Thus 

d 

g(x) = —alogx +bx —--—+c¢1, +063 
% 


and 


d 
h(x) = —alogx +bx — — -— 2-03. 
x 


Thus the twice differentiable solution of (15.31) is given by (15.31a). This proves 
the theorem. 


Remark. Let L be an arbitrary logarithmic function; i.e., a general solution of (L). 
Let A and A? be arbitrary additive functions, general solutions of (A). Then, if 
C1, C2, and c3 are arbitrary constants, it follows that a solution of (15.31) is given by 


f(x) =L@)+ AQ) +e1, 


g(x) = L(x) + A(x) — Az ) +c. +03, 


)- eres, 


Ble ele 


A= Tj RAD =A ( 


k(x) = L(x) + A2(®) +2. 
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15.4 D’Alembert’s Equation Revisited 


D’ Alembert’s equation (C) is solved over all finite groups (see [213] for this entire 
section). We introduce the notion of a basic d’Alembert function, one for which 
f(xy) = f(x) for all x implies that y = 1. It is shown that every d’Alembert 
function factors through a basic d’ Alembert function. Then it is shown that the only 
finite groups that support a basic d’Alembert function are the cyclic groups (the 
classical case) and the binary groups: (2,m,n) = (R,S,T): R* =S™=T" = 
RST in Coxeter’s notation. Conversely, each of these groups supports a nonclassical 
d’ Alembert function. 

A function f : G > C (G a group) is a d’Alembert function if f satisfies (C) 
with f(e) = 1. The functions cos, cosh are d’Alembert functions (solutions of (C)) 
on the additive group R of real numbers. Indeed they are, up to automorphisms 
of R, the only continuous d’ Alembert functions on R (see Chapter 3). In studying 
d’Alembert functions, the idea is to make similar categorical assertions. When G 
is Abelian, Kannappan’s result [424] gives a definitive characterization: If f is a 
d’ Alembert function on G, then there is a homomorphism h : G — C* such that, 
for all x in G, (3.37) holds. It is easy to verify that if G is a general group and 
h : G > C* is a homomorphism, then the function h:Goc given by 


_ Ae) +h*(@) 
= a 


is a d’Alembert function of G. However, it is “really” a d’Alembert function on 


oy: the Abelianized version of G, since for all x, y, z in G, 


(15.40) h(x) 


A(xLy, 2]) = A(x), 


where [y,z] := yzy~!z7! is the commutator of y and z. To capture this sort of 
decomposition/factorization, the following definition is introduced: The d’ Alembert 
function f is basic if 


(15.41) f(xy) = f@)VxEG)> y=e. 


We see that, on R, cosh is basic, while cos is not. 

We show in Theorem 15.14 that every d’Alembert function is the composition 
of a group epimorphism followed by a basic d’Alembert function. So we need 
only characterize basic d’Alembert functions. Consequently, we say the group 
G is a d’Alembert group if there is a basic d’Alembert function with domain G. 
Thus R is a d’Alembert group (with cosh as the basic d’Alembert function) and 
S! = {z €C: |z| = 1} isad’Alembert group with cos : z > 5(z + Z) as the basic 
d’Alembert function; note that cos(x) = cds(e'*). Indeed it is easy to see (Proposi- 
tion 15.13) from Kannappan’s Theorem that the only Abelian d’ Alembert groups are 
those isomorphic to subgroups of C*. 

The main result is the characterization of the finite d’ Alembert groups as follows. 


Result 15.12. (Davison [213]). The finite group G is ad’Alembert group if, and only 
if, it is isomorphic to one of the groups below: 


di 
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(i) the cyclic group Zp of order n, 
(ii) the binary dihedral group (2,2, n) of order 4n, 
(iii) the binary tetrahedral group (2, 3,3) of order 24, 
(iv) the binary octahedral group (2,3, 4) of order 48, 
(v) the binary octahedral group (2,3, 5) of order 120. 


NS 


Here we use Coxeter’s notation (2, m,n) (see [178, Ch. 6.5]) to denote the group 
with generators R, S, T and relations 


R*=S" =T" = RST. 


That each of the groups in the theorem is, in fact, a d’Alembert group is proved. 
We show, using results of Wolf [833], that a finite solvable d’ Alembert group must 
be one of those listed, and using Suzuki’s result [781], we show that the only finite 
nonsolvable d’ Alembert group is the binary icosahedral group. 

After we prove Theorem 15.14, we provide some of its consequences, as well as 
pointing to possible future developments of this approach. 


15.4.1 Basic d’Alembert Functions 


In this section, we characterize the classical (given by equation (15.40)) d’ Alembert 
functions that are basic and deduce some consequences [213]. We also introduce a 
subgroup called the nub of F that yields the basic theorem. We then relate certain 
standard subgroups to well-known subgroups of G. Finally, we give a simple charac- 
terization of the nub of f when f is assumed to be a bounded function. Throughout 
this section, and later, we find Stetkaer’s work [772, 771] of inestimable importance. 


Proposition 15.13. Leth : G — C* be a homomorphism. The d’Alembert function 
(15.40) is basic if, and only if, ker(h) = (e). 


Proof. Referring to (15.41), hy) = h(x) if, and only if, 


A(x)fa(y) — 1 = AG')ACy|[hGQ) = 11. 


So h(xy) = A(x) for all x in G if, and only if, h(x) = 1; that is, if, and only if, 
y € ker(h). 


Using this, we can prove the following. 


Proposition 15.13a. [213]. Let G be an Abelian group and let f : G > C bea 
d’Alembert function on G. If f is basic, then G is isomorphic to a subgroup of C*. 
In particular, if f is basic and G is finite, then G is a cyclic group. 


Proof. By Kannappan’s Theorem [424], f = h for some h since G is Abelian. 
If f is basic, then ker(h) = (e) by Proposition 15.13, soh : G > C* is injec- 
tive (a monomorphism) and thus G is isomorphic to a subgroup of C*. Since finite 
subgroups of C* are cyclic, the final statement is proved. 


When G is non-Abelian, we need an analogue of ker(/). This is provided, as will 
be seen by the nub of /f. 
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Definition. Let f be a d’ Alembert function on the group G. The nub of f is 
N(f) = (9 €G: f(y) = f@) Vx € G}. 


We see, from this, that f is basic if and only if N (f) = (e). Reconsidering 
Proposition 15.13’s proof, we see that N(i) = ker(h). This leads naturally to our 
next result. 


Proposition 15.13b. [213]. N(f) is a normal subgroup of G. 


Proof. First we observe that e € N(f) since xe = x. Now suppose y,z € N(f). 
Then, from x(yz) = (xy)z, we deduce that yz € N(f). Next, if y € N(f), then 
f(x) = f(xe) = f(xy !y) = f(xy!) for all x € G. So y~! € N(f). Thus N(f) 
is a subgroup of G. 

Finally, to show that N(f) is a normal subgroup of G, suppose that y € N(f) 
and z € G. Since d’ Alembert functions are central [772, Lemma 5.1], we have for 
allx eG 


flazyz”') =f lxzy) = f(e-'xz) = f@). 
Thus zyz~! € N(f) and N(f) is normal. 


It is reasonable now to ask what happens when N(/f) is “factored out”. Here is 
the answer. 


Theorem 15.14. [213]. Let f be ad’Alembert function on G. Let 6: G > yy =: 
G be the canonical epimorphism of groups. Define f on G by the rule 


f(@(x)) := f(x) forallx €G. 
Then 7 is a function on G. Indeed 7 is a basic d’Alembert function. Thus 
f=fo¢ 
is a factorization of f into an epimorphism followed by a basic d’Alembert function. 


Proof. First we show that f is a function. So suppose that d(x) = #(y) for x, y 
in G; we have to show that f(x) = f(y). Now £(x) = ¢(y) if and only if x7! y € 
N(f). Hence f(x) = f(xx~!y) = f(y), as claimed. 


Next we show that f is a d’Alembert function. For all x, y in G, 
FGMOO) + FBO") = FOCY) + FG") 
= flay) + fay) 
= 2f@)f) 
= 2f(G@) FOO), 
and f(p(e)) = f(e)= 1. _ _ 
Finally, we show that f is basic. Suppose that f(¢(x)¢()) = f(P()) for all 


x in G. Then f(xy) = f(x) for all x in G, so y € N(f) and d(y) = d(e). Hence 
N(f) = (¢(e)) = (@) and f is basic. 
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We remark (see [772, Lemma 6.4]) that if / is a homomorphism from G to C%*, 
then h = h, where h : G/ker(h) > C*. 

For cosh, we see that (x) = exp(x) is the canonical epimorphism that is an 
epimorphism of R with the positive real numbers, and similarly, for cos we have 
(x) = exp(ix). We see further that cosh and cos are restrictions of the function 


ztz7! 
2 
of C* to C to subgroups. So this equation gives the “universal” Abelian d’ Alembert 


function in the sense that every d’Alembert function on an Abelian group is of the 
form c oh, where h is a homomorphism into C*. 


CLL 


We now turn to more general basic d’ Alembert functions and see that they have 
some pleasant properties. 

In analyzing Kannappan’s proof with a view to extending the result to more gen- 
eral groups, the following has proved useful [771]. 


Definition. Let f be a d’ Alembert function on G. Then 


Z(f): {ue G: f(xuy) = f(xyu), Vx,y € Gh. 


Our principal result is the following. 


Proposition 15.13c. [213]. Suppose f is a basic d’Alembert function on G. Then 
Z(f) = Z(G), the centre of G. 


Proof. Since xuy = xyu forall u € Z(G), itis clear that Z(G) is contained in Z(f). 
Assume now that u € Z(f). Then, for all x, y € G, 


f(xly,ul) = f@ryuy lw!) = foryy lulu) = f@), 
so [y,u] € N(f) = (e) for all y € G. Thus u € Z(G). This shows that Z(f) is 
contained in Z(G), and we are done. 
The next result will be used frequently when G is finite. 
Proposition 15.13d. [213]. Suppose f : G > C is a bounded d’Alembert function. 
Then 
N(fy={yeG, fy) =H. 


In particular, if f is basic, 
f= 1S y=e. 


If f is basic and H is a subgroup of G, then f\y is a basic d’Alembert function 
on H. 


That the basic d’ Alembert function on G is bounded is sufficient to ensure that 
every subgroup of G is a d’Alembert group. However, it is not necessary, for every 
subgroup of C* is a d’Alembert group with function z > as even though this 
function is not bounded on C*. 


592 15 Miscellaneous Equations 


15.4.2 D’Alembert Groups: Examples 


In this section, we give explicit examples of d’Alembert groups as groups of 2 x 2 
matrices, showing in particular that the binary dihedral groups are d’Alembert 
groups, as are the binary polyhedral groups (corollaries to Proposition 15.13e). 

Our results all depend on the fact that the rescaled trace function when restricted 
to matrices of determinant | is a d’Alembert function. (It is convenient, though 
nonstandard, to denote the identity matrix 


[0'] 


by 7.) This we show now. 


Lemma. Define c : SL2(C) > C by 


1 X+xXx! 
X):=-t ————_. 
c(X) 5 nee( 5 ) 


forall X € SL2(C). Then c is a d’Alembert function on SL2(C). 


Remark. We see from the definition of c that c(X~!) = c(X) for all X; also c 
is central since trace is. These are properties shared by all d’Alembert functions. 
We now use our function c and its restriction to produce d’ Alembert groups. 


Proposition 15.13e. [213]. The group SU2(C) given by 


{[F2] ace, P+ oP =1] 


and all of its subgroups are d’Alembert groups. 


Proof. Let G be a subgroup of SU2(C). Define f : G > C by f(X) = c(X) for all 
x € C. We claim that f is a basic d’ Alembert function on G. Since c is a d’ Alembert 
function, so is f. Now suppose 


isin N(f). Then f(Y) = 1, soa+a@ = 2. But |a| < 1, so we must have a = 1. Thus 
Y =i (identity). This shows that N(f) = (i), so f is basic and G is a d’ Alembert 
group. 


Remark. Since |c(X)| < 1 for all x € SU2(C), we see that c : SU2(C) > Cis an 
example of a bounded d’ Alembert function. 


Corollary (i). The cyclic group Zy of order n is a d’Alembert group. 


Proof. Let w € C be a primitive nth root of unity. Put 


e=([m)) 


Then G is a subgroup of SU2(C) and G ~ Zy. 
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Corollary (ii). The binary dihedral group (2,2, n) of order 4n is a d’Alembert group. 


Proof. Let w € C* be a primitive (2)th root of unity, so w” = —1, and put 


01 Ow w 0 
r=| $5): s=|_5 ap =| Oo 


Then 


Hence (R, S, T) is ahomomorphic image of (2,2, 7). But |(T)| = 2n and R ¢ (T), 
so (R, S,7T) has order at least 4n. Now (2,2,n) has order 4n [178, Ch. 6.5], so 
(R,S,T) & (2,2,n). Hence (2, 2,7) is a d’Alembert group. 

Corollary (iii). The binary polyhedral groups (2,3, n) forn = 3, 4,5 are d’Alembert 


groups. 


Proof. We choose generators in SU2(C) to satisfy the relations for (2, 3,7) as fol- 
lows. Choose t € C such that whenn = 3, t = 1; n = 4, t2 = 2: and 
n=5, t? =t+1.Putc =¢t +iVv3— 22. Then 


satisfy, as can be verified, the relations 


2 3 n : -1 0 
R=S°=T =rst=-i=(5 ahs 


So the subgroup (R, S, T) of SU2(C) satisfies the relations of (2, 3,1). Again it is 
straightforward to check that (R,S,7) has the correct order, namely 24 forn = 
3, 48 forn = 4 and 120 forn = 5. 


At this point, we remark that another realization of (2,3,5) is used by group 
theorists, specifically Suzuki [781]. Let F,, denote the finite field with p elements, 
where p is a prime. Then 


SLo(p) = SL2(Fp) := (|: | :a,b,c,d € Fp, ad —be = 7 


We have an isomorphism (2, 3,5) = SL2(5). Just take 


[to], s=[33) 7=[23] 


in SL2(5). Then 
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15.4.3 D’Alembert Groups: Generalities 


We will show later that a finite group is a d’Alembert group only if it has a very 
particular structure. In this section, we develop tools to delimit the structure of finite 
d’ Alembert groups. 

Our first result concerns the centre Z(G) of a d’ Alembert group (not necessarily 
finite). We know from Proposition 15.13c that Z(f) = Z(G) when f is basic. 
We will use this fact in quoting results from Stetkaer [772, 771]. 


Proposition. [213]. Let G be a non-Abelian d’Alembert group. Then 
Z(G) ={ueG:u* =e}, 


and Z(G) has at most two elements. 


1 


Proposition. Let G be a finite d’Alembert group. Then, for allx, y € G, if yxy~* € 


(x), then yxy! € {x, x7}. 
Proposition. Let G be a finite d’Alembert group. Then 


(i) every Abelian subgroup of G is cyclic, 
(ii) Z(G) = Gor Z(G) < 2, and 
(iii) if yxy"! = x*, then yxy} e igo), 


15.4.4 Solvable Finite d’Alembert Groups 


Wolf [833] has provided us with a convenient and easy-to-use formulation of Zassen- 
haus’s results characterizing the solvable finite groups that have every Abelian sub- 
group cyclic. He divides them into four types (I, II, III, IV), giving generators and 
relations as well as arithmetical conditions. We will use these types to determine 
necessary conditions for a finite solvable group to be a d’ Alembert group. 


Proposition 15.15. [213]. Let G be a Type I group. Thus G has order mn, G = 
(a,b) witha" =e, b" =e, bab—! = a", wherem > 1, n> 1, mandn(r — 1) 
are relatively prime, andr" =1 modm. 


G is ad’ Alembert group only if either m = 1 or m is odd, m > 3, n = 4, and 
r = —1. In the first case, G = Z,, and in the second, G = (2,2, m). 


Proposition 15.15a. [213]. Let G be a Type Il group. Thus G has order 2mn, G = 
(a, b, C) with (a, b) a Type I group as in Proposition 15.15, and the additional gen- 
erator C satisfying C? = b?, CaC~! =a’, CbC~! = DF, where n = 2"v with v 
odd and u > 2, as well as €2 = r*-! = 1 modm, k = —1 mod 2", andk? = 1 
mod n. 

G is ad’Alembert group only if it is isomorphic to (2,2, =), where n is divisible 
by 4. 


Proposition 15.15b. [213]. Let G be a Type IU group. Thus G is of order 8mn, G = 
(a, b, p,q), where (a, b) is of Type I, and the additional generators satisfy 
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pP=e pHc’=(&o.. ap= pa, 
aq=qa, bpb'=q, bqb'=pq, 


with n odd and divisible by 3. G is a d’Alembert group only if it is isomorphic to 
(2, 2,3), the binary tetrahedral group of order 24. 


Finally, we turn to Type IV groups. 


Proposition 15.15c. [213]. Let G be a Type IV group. Thus G is of order 
lomn, G = (a,b, p,g,C), where (a,b, p,q) is a Type Ill group, and the addi- 
tional generator C satisfies 


C= Pp. Cpe =P, Cac — a Cac"! =a‘, cbc"! = bk, 


with kX =1 modn, k =—1 mod 3, andr*—! = €2 =1 modm. 
Gisad’Alembert group only if it is isomorphic to (2,3, 4), the binary octahedral 
group of order 48. 


It must be noted that Wolf [833], at the end of the first section of his Chapter 6, 
mentions all of our groups—the cyclic, binary dihedral, binary tetrahedral, and bi- 
nary octahedral—as occurring in the Type I, II, UI, IV classification. In particular, 
he mentions that (2, 3, 3) is Type III with m = 1 andn = 3, and (2, 3, 4) is Type IV 
with m = 1, n=3,andk = —-1. 


Proposition. Let G be a finite solvable d’Alembert group. Then Z(G) = G or 
Z(G) = Zp. 


Corollary. [f |G| is not divisible by 4, then G is Abelian. 


15.4.5 Nonsolvable Finite d’Alembert Groups 


We show in this section that, up to isormorphism, there is only one nonsolvable finite 
d’ Alembert group—the binary icosahedral group (2, 3,5), or equivalently SL2(5). 

It is the matrix version of (2, 3,5) that is employed in this section because the 
group-theoretic result that we quote, Suzuki’s Theorem, uses it in a significant way 
to describe the action of an involute automorphism. 


Suzuki’s Theorem. [781, Thm. E]. Every Abelian subgroup of a nonsolvable group 
G of finite order is cyclic if, and only if, either 


(a) G=H x SL2(p), p > 3 is prime, H is ametacyclic group, and |H | and p are 
relatively prime or 

(b) G is generated by € = SL2(p), where p > 3, and three elements R, S, T, 
where R™ = S™ = T* =i, RSR7! =S", TRT“!=R, TST~! =S™', and 
RX = XR, SX = XS, TXT™! = 0(X) forall X € €, T? € Z(€), and the 
involute automorphism @ : € + € is given by 


0 ab|_ | d —-w'e 
cd| |-—wb a : 


596 15 Miscellaneous Equations 


where w € ie is a generator of ae Here m, n, r are integers with m and n 


relatively prime, mn is prime to p(p? — 1), andr™ =1 mod n. 


We recall that H (in (a) above) is metacyclic if, and only if, all Sylow subgroups 
are cyclic [178, Ch. 1, Sect. 9]. Note that a metacyclic group is solvable. Since 
SL2(p) is central to Suzuki’s Theorem, it is convenient to determine necessary con- 
ditions on the prime p such that SL2(p) is a d’Alembert group. 


Proposition. [213]. Let p be a prime. Then SL2(p) is a d’Alembert group only if 
p € (3,5). 


Proposition 15.15d. [213]. Let G be a nonsolvable finite group all of whose Abelian 
subgroups are cyclic. G is a d’Alembert group only if G is isomorphic to (2, 3, 5)— 
the binary icosahedral group. 


Proof of Theorem 15.14; Consequences 


We state Theorem 15.14 again in a slightly condensed form to remind us of what we 
have to prove. 


Theorem 15.14. [213]. The finite group G is a d’Alembert group if, and only if, it is 
cyclic, binary dihedral, or binary polyhedral. 


Proof. The sufficiency is proved in the three corollaries to Proposition 15.13e, where 
all groups mentioned are exhibited as subgroups of SU2(C). 

The necessity is proved in Propositions 15.15, 15.15a, 15.15b, and 15.15c for 
solvable groups and Proposition 15.15d for nonsolvable groups. 


One easy consequence of Theorem 15.14 is the following. 


Proposition. The finite subgroups of SL2(C) are precisely the finite d’Alembert 
groups. 


Stetkaer [772, Theorem 7.2] characterized the d’Alembert functions on step 2 
nilpotent groups. Theorem 15.14 shows that it is sufficient to characterize the basic 
d’ Alembert functions, so the critical result is (cf. [772, Lemma 8.4]) the following. 


Proposition 15.16. (Corovei [169]). If G is a step 2 nilpotent d’Alembert group, 
then G is isomorphic to the binary dihedral group of order 8, the quaternion group. 


Corollary. [169, Theorem 7.2]. If G is a step 2 nilpotent group and f is ad’Alembert 
function on G, then either f is classical or there is an epimorphism 9 : G > 
(2, 2,2) such that f = f o@, where f is the unique basic d’Alembert function on 
(2, 2, 2). 


Proof. As in Theorem 15.14, let g denote the canonical epimorphism from G to 
We =: G. Since G isa step 2 nilpotent group, G is either step | (Abelian) or step 
2 nilpotent. 

In the first case, Kannappan’s Theorem yields h : G — C* such that i hog. 


In the second case, h : G = (2,2, 2) by our proposition. 
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It seems likely that if G is a step n (n > 2) nilpotent d’Alembert group, then 
he G & Q,2,2°-, 4 generalized quaternion group, so nilpotency is a strong 
finiteness condition. 

Finally, it may be possible to generalize Kannappan’s Theorem as follows. Let f 
be a d’Alembert function on G. Then there is a homomorphism 
h : G — SL2(C) such that, for all x € G, 


h(x) ee 


f@ms= sce 5 


This truly extends Kannappan via the embedding of C* in SL2(C) by 


z 0 
LS Og! . 


We conclude by commenting that if g satisfies the “long” d’ Alembert functional 
equation, we can define N(g), the nub of g, and prove Z(g) = Z(G) when g is 
basic. However, the noncentrality of g so far poses a barrier to further results. 
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We studied the mean value theorem and some of its generalizations in Chapter 7. 
Here we consider more generalizations of the mean value theorem. 


Theorem 15.17. Let f,¢,h,2:R— R such that 


f@)- FO) _ 
(15.42) T= A) = (x) +h(y), forx,yeR, x Fy, 


holds with 2(x) 4 A(y) for x # y. Then 


fQ@) =A@)a +b — cA(x)) +4’, 
(15.42a) h(x) =a—cA(x), 
(x) =b—-ch(x), x ER, 


and 4(x) is arbitrary and a, b, c are real constants. 


Proof. It is easy to see that f — f(0) and 2 — A(0) is also a solution of (15.42). 
So, without loss of generality, assume that f (0) = 0 = (0). 
Setting y = 0 and x = 0 in (15.42), we have 


(15.43) f(x) =4a)(¢d(x) +4), (@=hA(O)) forx ER, x £0, 
(15.432) f) =A) AQ) +4), =$O) foryeR, y <0. 


Note that (15.43) and (15.43a) hold for x = 0, y = O and that 2(x) 4 0 forx 40 
and / is 1-1. 
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Substituting these values of f(x) and f(y) in (15.42) yields 
(15.44) 2(x)(-h(y) +a) = A(y)(—b + d(x))_ for x, y € R*. 
If h(x) =a, then d(x) = b (since 1(y) € 0 for y 4 0) and f(x) = A(x) + (a +) 
(use either (15.43) or (15.43a)), which is (15.42a) with c = 0. 

Otherwise, from (15.44) there follows 
(15.45) A(x) =ci(a—hA(x)) forx ER. 
Interchange x and y in (15.42) to get 

P(x) +h(y) = o(y) +h); 


that is, 


(15.45a) P(x) =c2+h(x) forx ER. 


Now (15.45), (15.45a), and (15.43) give (15.42a), the solution sought. 


Remark. It is easy to see that (15.42) has many solutions dependent on / and the 
solutions are obtained without assuming any regularity condition on f, 4, ¢, h. The 
mean value theorem is one among the solutions. 

The mean value theorem is obtained from Theorem 15.17 by taking A(x) = x. 
Then 


hina, éQ=otea— fay=*(QQatea-*), 
Cc Cl cl 
Even though 


2 
AQw)=x, A@)=a-~, 
Cl 


2 


2 
$@)=ata—~, and oy=e(m+0-2) 
C1 CI 


satisfy (15.42), it does not constitute a solution since A(x) = A(—x) forx #4 —x. 


We can create many more solutions of (15.42) by choosing A(x) suitably with 
(A(0) = 0), A(x) 4 AVC), forx ¢ y. For example, 


A(x) =e*-1, fx) =(* -1I@+b-ce* —- 1), 
h(x)=a-c(e’-1), d(x) =b-c(e’* —-1) 


constitutes a solution of (15.42). 
Another generalization is considered in the following. 
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Theorem 15.18. Suppose f, g,h,¢,4:R — R satisfy 


546 $780) _ paying) fors¢y, x yER, 
Ae) = Ay) 


with A(x) 4 A(y) forx # y. Then f, g, h, are given by 


f@) =AG)@+b+cd(x)) +a’ = g(x), 
(15.47) d(x) =cdA(x) +4, 
h(x) =cA(x) +b, forx ER, 


where 1(x) is arbitrary with 1(x) 4 A(y) for x # y anda, b, c, a’ are constants. 


Proof. Note that 2— (0) is also a solution of (15.46). So let us assume that 4(0) = 0. 
Also note that A(x) 4 0 for x 4 0 and J is one to one. 
Putting y = 0 and x = 0 separately in (15.46), we get 


(15.48) fx) =4a)(d(%)+a)4+c, forx 40, 
(15.48a) 8(y) =AQ)AQ) +4) +c. fory £0, 


where f(0) = ci, g(0) =c2, hO) =a, (0) =b. 
Substitution of (15.48) and (15.48a) in (15.46) gives 


(15.49) A(x)(P(X) +a) +01 -A(V) (AQ) +8) — €2 = AX) —- AD) (G@) +h), 
x,y #0, x #y. Interchange x and y in (15.49). 
(i) First add the resultant to (15.49) to have 
Max\(aly) — 4(0)) = A0)(a(x) — @)), 
where a(x) = ¢(x) + h(x), from which results (since A(x) # 0, x 4 0) 
(15.50) b(x) +(x) = a(x) = c3a(x) +a) forx £0 


(which also holds for x = 0). 
(ii) Subtracting the resultant from (15.49), we obtain 


A(x)(B(y) — BO)) + AW) BO) — BO)) + 2(c2 — ¢1) = 9, 
forx £0, y 40, x # y, where f(x) = d(x) —h(x), x € R, or 
(15.50a) P(x) —h(x) = B(x) = cad(x) +65, x #0 


(use A(y) #0, y #0). 
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From (15.50) and (15.50a), we have 
(15.50b) $(x) =DiAG)+b2, A(x) =b3A(x) + ba, x £0, 


where b; (i = | to 4) are constants. 
Utilizing (15.47) and (15.50b), (15.46) yields 


A(x)(a — ba) + A(y) (bz — b) = c2 — €1 + (3 — Di)A)A(Y), x,y #0. 
Fixing y (4 0) as yo, we get 
A(x)(a — by — A(y0)(b3 — b1)) = c2 — c1 — A(yo)(b2 — 5). 
Since / is not constant, we get 


a — bg = A(yo)(b3 — D1) 
= 4(91)(b3 — 1) (1 # Yo). 


Since / is one to one, this leads to b3 = by, a = bag and then cz = cj, b2 = b, 
giving the asserted solution (15.47) (cf. (15.42a)). 


Remark. As before, note that (15.46) has many solutions dependent on 2, and the 
mean value theorem is among them. 


Now we consider the functional equation 


(15.51) ID 80) ~ pia, x,yEeR x Fy, 


A(x) — Ay) 
where f, g,4,2:R— R with A(x) 4 1(y) for x ¥ y, and show that 
(15.5 1a) f@)=cAQx) + b= g(x), A arbitrary. 


Notice that 2 — 2(0) is also a solution of (15.51), so take A(0) = 0. Interchanging 
x and y in (15.51) results in 


f(x) — g(y) = g(x) — fQ): 


that is, f = g. Set y = 0 in (15.51) to have f(x) = b+ a(x), where b = f(0) = 
g(0) and h(O) = a. This proves this result. 


15.6 Inner Products Revisited 


First we prove the following theorem, which is used in Theorem 15.20. 


Theorem 15.19. The functions t; : R > RG = 1,2,...,n) (n = 3) satisfy the 
equation 
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n 
(15.52) DM AOi)=d, yi eR, 
i=l 


with >-"_, yi = 0, provided there exist an additive A and constants a; such that 


(15.52a) ti(x) = A(x) +a;, forx ER, 


with oF) a =d. 


Proof. Let yj) =x, yo =y, y3=—-x—y, ya =O=--- = yy to get 
(15.53) h(x) +hiy)+(—-x -—y)=c, forx,y ER, 


which is a Pexider equation. 
Interchange x and y in (15.53) to have 


(15.53a) to(x) — t1(x) = constant = c2, 
so that (15.53) becomes 
(15.53b) ti (x) + ti(y) + t3(—x — y) = 03. 


Now y = O yields 
ti(x) + B(—x) =c 


and 
h)t+no)=naty)+cea forx,yEeR. 


Thus there is an additive A : R — R such that 
ty(x) = A(x) + aq. 


From (15.53a), (15.53b), and (15.52), we get (15.52a) the solution sought. 
Remark. Compare this result with Theorem 1.76. 


In Chapter 5, we considered the functional equation (Result 5.7a) 
n n 
(5.25a) Doi —¥) =D) bi) — ny &) 
i=l i=l 
for x; in ann.l.s. K. Here we consider the most general equation of this type, 
n n 
(15.54) > i —%) = D> vias) + y@), 
i=1 i=1 


for x; € K, ann.Ls., 
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Theorem 15.20. (See Kannappan [479].) Functions ¢j,wi,w : K ~ RG = 
1,2,...,2) satisfy the functional equation (15.54) for n > 3 overa realn.l.s. K are 
given by 


di (x) = Bx, x) + (Ai + Aix) + bi, 


wi(x) = B(x, x)+ Aj(x) +c, 
(15.55) 


y(x) = —nB(x, x) — >) Aix) — )°Gi - ci), 


i=l i=1 


where Aj, A’ are additive, B is biadditive, and b;, c; are constants. 


Proof. We present a proof different from that in [479]. Putting xj = x = x2 =---= 
Xn in (15.54) yields 


(15.56) vax)= ici Di wi), xe K, 
is! i=1 


where ¢;(0) = cj. 
We have to treat the cases where n is even or n 1s odd. 


Case (i). n = 2k, n even. 
Set xy = +--+ = xp = 2K, Xp4y = +++ = X2K = 2y in (15.54) to get 
(15.57) 
(pi ++ + Pe) — y) + Geri +++ + bar)(y — x) = Bx) + O2y) + w+ y), 


where 


(15.57a) Bx) = (Vite + yer), Oe) = (Weg + + War) x). 


Case (ii). n = 2k + 1, n odd. 
Put 


Xp Srr Sxp =H 2X, Nee He HAM HZ2V, X21 = X+y, 
so that (15.54) becomes 


(1 +--+ + be)(x — y) + (Peer $+ + bx) (y — x) + G2n41(0) 
= B(2x)+ d2y) + waste +y)t+ywaty), 


which is the same as (15.57). So we solve (15.57) setting 
(15.576) 2g (x) = (Pi +--+ + Pe) (®) + Pepi +++ + G2x)(—x), x EK. 
Equation (15.57) can be written as 


(15.57c) 2q(x —y) = fQx)+dQ2y)+ warty), x, yek. 


15.6 Inner Products Revisited 
With x + y =u, x — y =p, this equation becomes 
(15.58) Blu+v) +d(u — v) = 2q(v) — wu). 
Put u = 0 in (15.58) to get 

B(v) + 6(—v) = 2q(v) — (0); 
that is, by (15.56) and (15.57a), 
Blu +0) + du — v) = Bo.) + d(-v) + yO) + B+ d)(U) — ae 
(15.59) 1 
= (B + 6)(u) + B(v) + 6(—v) — (B + 6)(0), 
u,veEeK. 

Change v to —v in (15.59) to have 
(15.59a) = B(u—v) + d(u+ 0) = (B + 6)(u) + B(—v) + d(v) — (B + 6)(0). 
First adding (15.59) and (15.59a) results in 


(B + d)(u + v) + (B+ d)(u — v) = 2(8 + d)(u) + (B + d)(0) 
+ (B + 0)(—v) — 2(B + 0)0), 


which is the same as (4.22) (see [230, 241], Remark 4.3). 
There exists an additive A and a symmetric biadditive B such that 


(15.60) (B+ 6)(x) = B(x,x) + A(x) +c forx eK. 
Second subtract (15.59a) from (15.59) to get 


(B — d)(u +0) —(B—d)u—0) 
(15.60a) = (£ — 6)(v) — (B — 6)(-v) 
= (B — 6)(—u + v) — (B — 6)(—u — 0); 


that is, 


(B — d)(u + v) + (B — 6)(—u — v) = (B — d)(u— 0) — B— 0)(—u +d) 


or 
(B — 6)(x) + (6 — 6)(—x) = constant forx € K. 


So (15.60a) becomes 


(B — o)(vo +u) + (B — d)(v — u) = 2(6 — 0)(0), 
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which is Jensen (J), so that 
(15.60b) (B —6)(x) = A'(x) +c" forx eK. 


From (15.60) and (15.60b) results (add) 


1 1 1 
(15.61) (yi +--+ wa) (x) = BQ) = 5B, x) + 5(A + A'\(x) + x +c”). 


This division of y;’s is arbitrary. Similarly, 


1 i! 
(wat + Wer@) = FBO, x) + 5(Al + AD) +0" 
(B is the same by considering y; + --- + wo, in both cases); that is, 
(15.62) Weei(x) = wilx) + Ag) + ep 44 forx € K 


(kK=1,2,...,2k —1). 
From (15.56) and (15.60), we obtain 


(15.63) w(x) = —B(x,x)- Aa) +a, 


with d some constant. 
Substituting y;.(x) from (15.62) into (15.57c) and using (15.63), we get 


(15.64) 
2q(x — y) = kw (2x) + (Az +--+» + Ag)(2x) +o +++ Hee thy (2y) 
+ (Anti t-++ + Aog)Qy) + cep t+ + c28 
—Baty,xt+ty)-A@t+y)td1. 


Let y = 0 and x = 0 separately in (15.64) to have 


(15.64a) 2q(x) = ky (2x) + (A2 +--+ + Ag)(2x) +2 +++ + 0K 
— B(x,x)-A(@) +d, 

(15.64b) = 2g(—y) = kwi2y) + (Angi +++ + Ark) (2y) + cep +++ + 2K 
— Bly, y)A(y) + 41. 


Hence we have 
2q(x — y) = 2q(x) + 2q(-y) — B(x, y)— BY, x) — a1. 
Replace y by —y in the equation above and add the resultant to obtain 
(4.22) q(x +y) + q(x — y) = 2q(x) + q(y) +q(-y) — a1. 
Hence, as before, we see that 


q(x) = B’(x, x) + A(x) +d) 
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for some additive A’ and biadditive B’. This q(x) in (15.64a) yields 
vir) = Bia, x) + AiG) +¢1 
and then 
(15.65) w(x) = Bix, x)+Aj(x)+cx, j=lton, 


where By, is biadditive and A ;’s are additive. 
Substituting of y; in (15.54) and using (15.56) yields 


w(x) = >) i) —nBi(x, x) 
1 


n n 
= > Ai) -doa, 

i=1 1 
nh 


>) ¢i@i — ¥) = D°LBi Gi, x1) + Ai(xi) + crn + >) GO) — n Bi, x) 
i=l 


a n 
— DIAG) — Di 
i=l i=l 


= DB Gi — ¥, x1 —X) + AiG —¥) + 61 )]; 


i=1 


that is, 
n n 
DleiGi -— 9) - BiG —¥,4; -F) - AiG - D1 = 4 
i=l i=l 
or 
n 
DVaod=d, yi=xi—*%, i=1ton, 
i=l 
where 


ti(x) = Pi(x) — Bi(x,x)— Ai), x EK. 
Then Theorem 15.19 shows that 
Pi(x) — Bix, x) — Aix) = ti(x) = AQ) +4, 
where A is additive or 
(15.66) Pi(x) = Bix, x) + (Ai + AVG) + ai. 


Thus (15.56), (15.65), and (15.66) give the asserted solution (15.55). This proves the 
theorem. 


16 


General Inequalities 


This chapter is devoted to functional inequalities. Functional inequalities occur in 
several fields, such as information theory, inner product spaces, geometry, complex 
analysis, trigonometry, and Cauchy, gamma, and beta equations. Classical A.M. 
> GM., logarithmic inequality, multiplicative inequality, trigonometric functional 
inequality, parallelogram identity, quadratic inequality, inequalities for the gamma 
and beta functions, Simpson’s inequality, inequalities from information theory and 
mixed theory, and reproducing scoring systems are treated. 


16.1 Cauchy Functional Inequalities 


First we start off with a familiar classical inequality, A.M. (arithmetic mean) > G.M. 
(geometric mean). 


Several proofs are known in the literature. The following simple proof is due to 
Polya [343] (see also Beckenbach and Bellman [108, p. 4-15]). 
Let az (k = 1 ton) € R,. Then 


and 


ak 


1 q x — 
In the inequality e* > 1+ x, setx = => 


—1(k=1 ton) to get 


k=1 
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that is, 
n 
ef > ik 
~ (A.M.)” 
which proves the inequality. 
The role of the exponential function in this argument could be played as well by 
any function f with the properties 


(16.1) fat+y)= f@)FO), 
(16.2) f(x) >=1l+x, 


where f is a real-valued function. 

It is shown in the following theorem that the only function f defined on an in- 
terval about 0 that satisfies both (16.1) and (16.2) on that interval is the function 
f(x) =e". 

Theorem 16.1. (Wetzel [828].) Let U be an open interval about 0 and f : U > R 
satisfy (16.1) and (16.2) whenever x, y and x + y belong to U. Then f (x) = e* for 
each x inU. 


Proof. By (16.1), f(x) = f((«/2)+(«/2)) = [f(«/2)° = 0 for each x in U. Thus, 
if f (xo) = 0, then f (xo/2) = 0, and by induction, f(2~”"x0) = 0 for each positive 
integer n. But (16.2) demands that f(x) > 0 near 0. It follows that f(x) > O for 
every x in U. 

Next we show that f must be differentiable at each point x of U and that f’(x) = 
f (x). Using (16.1) and (16.2), we see that, for small , 


f@+h)— fF) = FOMF@) — 1) = hf). 


Since f(x) = f(x+h—h) = f(xt+h)f(—A) = fe +h) —A), we have similarly, 
for small h, 


f@) 
l-h 


1 
Fes +m) ~ fy = poy] >t] =n 
Thus, for small, positive h, 


Ziern=7%) - LQ) 


Ft) h Sth 


and the reverse inequalities hold for small, negative h. Hence 


_ Sa +h) — f@) 
/ =] NE EEE ONES. 
f= i ; 
exists and equals f (x) for each x in U. 

It follows that 


d 
me £@) =e“[f'(x) — f@)] =0 
X 
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for every x € U, and so f(x) = Ce* for some constant C. Since (16.1) implies 
f() < 1 and (16.2) implies f(0) > 1, we have C = f(0) = 1, proving the 
theorem. 


We can generalize this result somewhat by relaxing condition (16.2). 


Result 16.2. [828]. Let f : U — R be such that (16.1) holds whenever x, y and 
x + y belong to U. If f is bounded below by a function g that is differentiable at 0 
and satisfies g(0) = 1, then f (x) = e forx € U, where k = f'(0). 


Corollary (i). [f f satisfies (16.1) on U, f (0) = 1, and f is differentiable at 0, then 
f(x) = e* forx eU. 


The assumption in the hypotheses of this corollary that f be differentiable at 0 
cannot be weakened to continuity, for the continuous function f(x) = (1 + |x|)7! 
satisfies (16.1) for all x and y and fails to be differentiable only at x = 0. 

The situation should be compared with the state of affairs for the functional equa- 
tion f(x + y) = f(x) f(). If f satisfies this equality for all real x and y, then the 
function g = log f satisfies the additive functional equation g(x ++ y) = g(x)+g(y), 
and if f is assumed to be continuous (even at just one point) or assumed to be 
bounded on some interval, or even if the graph of f is assumed not to be every- 
where dense in the upper half-plane, then g(x) = kx and f(x) = e* for every x 
(see [12] and Chapter 1). 

No result similar to that of Corollary (i) can be expected in the other extreme 
case f (0) = 0; any function f that vanishes for all nonpositive x, is monotonically 
increasing, and is bounded above by | satisfies (16.1). A more interesting example 
can be constructed as follows. Let h be any differentiable function that satisfies 
0 < A(x) < 0.9 and h(5) = A'(5) = h(7) = h'(7) = 0. Define 


0 forx <5, 

h(x for5 <x <7, 
Foye sis 

0) for7 <x <8, 


1 —exp{—(8 —x)*} forx > 8. 
Then f is a differentiable function with f(0) = 0 that satisfies (16.1) for all x and 
y. 
Corollary (ii). Let F be a function defined on an open interval U about 0 that 
satisfies 


(16.3) Fa+y)< F@)+FO) 


whenever x, y, and x + y belong to U. If F is bounded above by a function G that 
is differentiable at 0 and satisfies G(O) = 0 (and, in particular, if F is differentiable 
at 0 and satisfies F (0) = 0), then F(x) = kx for all x € U, where k is a constant. 


Proof. Apply Result 16.2 to f(x) = exp{—F (x)} with g(x) = exp{—G(x)}. 
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F satisfying the inequality (16.3) is called a subadditive function, and f satisfy- 
ing the inequality 


(16.3a) fEFWMS FOI) 


is called a superadditive function. 


16.2 Subadditive and Superadditive Functions 


Obviously the range of subadditive and superadditive functions are real numbers. 
We will consider them on various domains. 

Subadditive functions play an important role in the study of bounded additive 
transformations on linear spaces. They also play a basic role in the theory of semi- 
groups. A subadditive function need not be continuous (measurable) anywhere. Posi- 
tive homogeneous subadditive functions have important applications to the theory of 
convex sets [636] and to the uniqueness theory of differential equations (Hukachura 
[381]). 


Facts (Hille and Phillips [376]). 


(i) Let X be ann.l.s. and F : X — R satisfying (16.3) be nonnegative outside of 

a sphere. Then F(x) > 0 forall x. 
Proof. Suppose F(x) > 0 forall x, ||x|| => &. Then, for an x, ||x|| < k, there 
is a positive integer n with n||x|| > k. So, 0 < F(nx) < nF (x), implying 
F(x) > 0. 

(ii) F : X —> R satisfying (16.3) continuous at 0 with F (0) = 0 is continuous for 
allx € X. 
Proof. F(x) = F(x+h—h) < F(x+h)+F(—A). Evidently, F(x)— F(—h) < 
F(x +h) < F(x) + F(A) for all h. Continuity at 0 implies the continuity at 
all x. 

(iii) Let {F,}, a © index set, be a family of subadditive functions defined on R or 

R* or R*. Then F(t) = sup, Fy (ft) is also subadditive. 

(iv) Let {F,} be a pointwise convergent sequence of subadditive functions on R or 

R* or R*. Then F(t) = limy+o0 = (t) is also subadditive. 

(Rathore (eee): If g’(x) exists on Ja, co[ (a = 0), then g is subadditive if 

g'(x) < + and g is superadditive if g’(x) > a) Let F be subadditive on 

R and F'(x) exist on Ja, co[ (a > 0). If F(x) + F(—x) < 0 for x € Ja, oof, 
then F’(x) is monotonically nonincreasing on Ja, oo[. 

(vi) Let F be subadditive and be monotonic for sufficiently large positive values 


of x. Then Fis) — a finite limitas x — +00. If F(x) is monotonic for 
; de) 


~a 


(v 


sufficiently large negative x, then — a finite limit as x — —oo. 

(vii) Discontinuous solutions of (16.3) (subadditivity). Unlike the additive equation 
(A), (16.3) (subadditivity) has discontinuous solutions that can be constructed 
without the aid of the axiom of choice. As an example, define F : R > R by 
F(x)=m+kwhenm <x<m+4+1,meZ. 


16.2 Subadditive and Superadditive Functions 611 


Forn < y <n+1, Fat+y) =m4+n+korm+n+1+k, so that 

Foy) =F) Py) ie eS 1. 

This solution is a measurable discontinuous solution (cf. (A)). 

(a) If F : R > R satisfies (16.3), is odd, and is measurable, then F(x) = cx 
for some constant c, x € R. Replace x, y in (16.3) by —x and —y, and 
use F odd to get (A). 

(b) Every solution of (16.3) that is even is nowhere negative. x = 0 in (16.3) 
yields F (0) > 0. Use F even to have 2F (x) = F(x) + F(—x) => F(O) = 
0. 

(viii) (Rosenbaum [710]). Let RY. = {(x1,...,%n) : x > O}. 

F : RY — R defined by F(x) = (~ AKXk). , ak = O, is subadditive for 

r > Land superadditive for r > 1. 

F : R*. > Rdefined by F(t) = log(t + 1) is subadditive. 

F: Ry, > R given by F(t) = 3 + sint is subadditive. 

F : R" = R given by F(x) = (x")!/" is subadditive for r > 1 and superad- 


ditive for r < 1 (Minkowski’s inequality). 


Theorem 16.3. Let f : R — R satisfy superadditivity (16.3a) with f(1) = 1. Let 
g: Ry — R4 bea continuous, increasing function with g(0) = 0, g(._) = 1, such 
that 


(16.4) f(g(x)) = sf @)|) forx eR. 
Then f (x) =x forx €R. 


Proof. First claim that f(x) > 0 for x € R,. Choose x € R,. Then x = ny for 
some n € Z; and0 < y < 1. Moreover, y = g(t) for tf > 0 since g is increasing 
and continuous with g(0) = 0, g(1) = 1. From (16.3a) and (16.4), we get 


f(x) = fay) 2 nf (y) =nf(g@) 2 ng FO) = 0. 
It follows from (16.3a) that f is a monotone nondecreasing function with 
(16.5) f@)=0 and f(—x)<-—f(x) forx eR. 


Next we prove | f(x)| < |x| forx € R. Indeed g1) = 1 = fd) = f(g()) = 
f(gl|—1)) = gQf(—1))) and thus | f(—1)| = 1. But f(—1) < —1 by (16.5). Hence 
f(-D=-1. 

Use (16.3a) and (16.5) to get, forn € Z4, 


f(n) < —f(-n) = —f(a(-1)) < —nf(-1) = —n. 


But f(x) = n(f(1)) = a, so that f(n) = n, n € Z4. Let p,q € Z%}. Then 
p=f(p)=f (a2) >aqf (2) . Since f is a monotonic function, we have 


(16.6) f(x) <x forx eZ. 
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Choose x € R. Use (16.5) and (16.6) to obtain 
g(x) = f(g(xl)) = sdf @)) 


and 


(16.6a) If(x)| <|x], forx ER. 


Finally, claim f(x) = x. First let x = +2 for p,g € Zi. From (16.3a), (16.5), and 
f(a) =n forn € Z1, we have 


af (-2) ay (« (-2)) = f(—p) <-f(p) =p, 
q q 


which with (16.6a) yields 


Since f is monotonic, 
f@)=x forx <0. 


Let x > 0. There exists n € Zi such that x —n <0: 


f(x) =f@a@—-n+n)> fx—-n)+ f()=(-n)+n=x. 


Thus we obtain f(x) = x for all x € R. This completes the proof. 


16.3 Logarithmic Inequality 


Theorem 16.4. For functions f : G — X froma group G toa real or complex inner 
product space X, the inequality 


(16.7) IGM = IFC)+ FOI Gy €G) 
implies 
(L) fay=f@+f) @yEG). 


Proof. Let e denote the identity of the group G. With x = y = e in (16.7), we 
get f(e) = 0, and with y = x~! we then have f(x—!) = —f(x) forall x € G. 
Inequality (16.7) can be rewritten as 


(16.8) fe)? +2Re(f (x), f()) + IF OI? < If Gy)? forx, y € G. 


1 


Replacing x and y by xy and y “, respectively, in the inequality above, we get 


If @y)I? +2Re(f xy), fO7') + 11FO7 DIP S IF GIP: 
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that is, 


fy)? —2Re(f (ry), FO)) + IFOIIP? < IAGO. 
Adding the inequalities above and dividing by 2, we have 
(16.9) Re( f(x) + f(y) — fay), fQ)) < 0. 
Replacing x and y by x~! and xy, respectively, in (16.8), we obtain 
If@ DI? +2Re(f@"), F@y)) +I FYI? < IF ODI? 


that is, 


If GIP? — 2Re( F(x), fy) + IF Gy? < IFODI?. 
Also, combining this inequality with (16.8) yields 


(16.9a) Re( f(x) + f(y) — fy), f@)) < 0. 


Here replacing x and y by xy and y~! 


, respectively, we get 
(16.9b) Re( f(xy) — f) — f@), F@y)) < 0. 
Finally, adding (16.9), (16.9a), and (16.9b), we obtain 

If) + £0) — f@y)I? <0, 
which implies (L). The proof is complete. 


Remark. The same result is proved with the additional condition 
(K) f(xyz) = f(xzy) forx,y,2€G. 


Here the result is proved by reduction to the quadratic equation (Q). 
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It is pointed out that if f satisfies subadditivity || f(xy)|l < ||f@) + fO)II 
instead of the superadditivity (16.7) with (K), f (x7!) = — f(x), |[f(?)| = 21 f @)I. 
Then f satisfies (L). f(x) = log |x| satisfies all these conditions on R*. If G = X = 


Rand f(x) = 1, then f satisfies (16.3) but not (L). 


It is also clear that the theorem does not hold for arbitrary semigroups G. Con- 
sider the interval [0, +oo[ with the operation +. The function f : [0,+o00o[ ~ R 


given by 
f(x) =x? (x > 0) 


satisfies | f(x +y)| = |f@)+/()|, superadditivity, but we do not have f(x + y) = 


f(x) + fQ) forallx > 0, y= 0. 
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16.4 Multiplicative Inequality 
Result 16.5. Jf f is continuous and increasing in x > 0, is somewhere less than 


unity, and satisfies f (xy) < f(x) f(), then either 


(i) forx <6, y <0, where 6 #0, or 
(ii) for all values of x, y, 


we have respectively 


(i) f(x) < ax® forx <1or 
(ii) ax* > f(x) = f(1)x4/b for x < 1 and bx’ > f(x) => f(D* forx = 1, 
wherea,b>0,0<a<f. 


16.5 Convexity 


Definition. A subset C C R” is said to be convex provided, that for x, y € C, all 
points of the line segment joining x, y belong to C; that is, 


ax+(l-a)yCC for0<a<l. 


A real-valued function defined on an interval in R satisfying the inequality 
(Jensen) 


1 
(16.10) (=) < 51f@) + £0) 


is called a Jensen convex or J-convex function. The function is concave if the in- 
equality is reversed. J-convex functions need not be continuous. However, if a 
J-convex function is bounded above on a sufficiently large set, it must be continu- 
ous. 

A convex function partially monotonic at a point is continuous (Bereanu [119]). 
A measurable J-convex function is continuous (Hille and Phillips [376]) and satisfies 
the inequality (Hukachura [381]) 


(16.11) f(ax+(U-a)y)<af@)+Ud-a)fO), O<a<l. 


f satisfying (16.11) is known as a convex function and obviously satisfies (16.10), 
that is, itis J-convex. A convex function (16.10); that is discontinuous at an interior 
point of an interval is necessarily nonmeasurable. For finite measurable functions, 
(16.10) and (16.11) are equivalent. Equation (16.10) means geometrically that, for 
J-convex functions, the midpoint of the chord joining two points of the graph of the 
function lies “above” the graph of the midpoint of two points. 

The function f is concave if — f is convex. 


Result 16.6. [376]. Let f : Ri > R be such that (i) if $2 is decreasing, then 
f is subadditive; (ii) if f is J-convex and subadditive, then LE) js decreasing; and 


x 
(iii) a necessary and sufficient condition that a measurable, concave function f is 


subadditive is that f (0+) = 0. 
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Let @ be a positive number, and let ¢ be a real-valued function of a real variable. 
The function ¢ is said to be w-increasing in an interval U if and only if the inequality 


p(t +@) = P(t) 


holds for all ¢ such that t, ¢ + w € U; and ¢ is said to be w-decreasing in U if and 
only if the inequality 
ot +o) < o() 


holds for all f such that t, t-+@e€U. 

Let Q be a set of positive numbers. The function ¢ is said to be Q-monotonic in 
U if and only if it is either w-increasing in U for every w € Q or w-decreasing in U 
for every a € Q. 


Result 16.7. [555]. Let f : C — R be aconvex function in an open convex set C C 


R", n = 1, let xo be a point in C, let aj, i = 1,...,n, be n linearly independent 
n-dimensional vectors, let ¢;, i = 1,...,n, be positive numbers, and let Q| C 
(0, €;), i =1,...,n, be sets from the class 0. If, for eachi =1,...,n, the function 


Pilt) = fxo + taj) 


is Qj-monotonic in (—¢;, €;), then f is continuous in C. 


Lemma (i). (Haruki [348]). If f, g are entire functions of z and | f (z)| < m|g(z)| 
in |z| < +00, where m (= 0) is a real constant, then f(z) = cg(z) in |z| < +00, 
where c is a complex constant with |c| < m. 
Lemma (ii). [348]. Jf H is an entire function of z and g(t) = |H (te'?)|? (t, ¢ real, 
od fixed), then 

(0) = 2Re(e*? H" (0)H@)) + 2|H'(0)/*. 


From the Jensen equation (J), we obtain the inequality 


(16.12) 5 (FP) <A o 


where f is an entire function on C. 
Now we shall determine all the entire functions that satisfy (16.12) (see [348, 
346, 347]). 


Theorem 16.8. (Haruki [348, 346]). Jf f(x) is an entire function of x, then the 
functions that satisfy (16.12) are (ax+f)" and exp(ax+f/), where a, B are arbitrary 
complex constants and n is an arbitrary natural number, and only these. 


Proof. Since $42 < ,/f+”. where a, b are real, by 2| f(x)| < |f(@+y)I+1f@— 
y)|, which is obtained from (16.12), we have 


(16.13) AFG)? <If@+y I? +1F@ — yl. 
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Using a real parameter, t, p(t) = | f(y + te!)|? + |f(y — te™)|? has a minimum 
p(O) = 2\f (y)|? by (16.13). Here y is an arbitrary complex constant and @ is an 
arbitrary real constant. Hence we have 


p"(0) = 0. 
By this and Lemma (ii), we have 
Re(e™? f'")FO)) + IFW)? = O. 
Choosing 69 properly, we have 
Re(e™ Fy) FO) = IF" FOIL 

Hence we have 

FOF ON SIF OP. 
Hence we have the following inequality at every point x: 

IFO)F"@) <1F'@)P. 
By Lemma (i), we have in |x| < +00 

FRI") = cf), 


where c is a complex constant with |c| < 1. By solving this differential equation, the 
theorem is proved. 


Remark. If f(x) is ameromorphic function and satisfies (16.12) in |x| < +-oo, then 
we can easily prove that f(x) is an entire function of x. 


Applications. By the theorem above, we can solve the following functional equa- 
tions under the hypothesis that f(x), and g(x) are entire functions of x: 

(i) f (+2) — Loy fO) 

Gi) faty)=f@)+ fQ), 

Gi) f@+y) = f@SfO), 

(iv) fay) = f@)FO), 

(v) f@+y)f@—y) = f7@), 

(vi) If@ +y) +1F@ — yl = 2F@)1+ 2igsO), 
(vii) [f@ + y+ 1F@ — yl = 21F@) + 21F ODI. 


Solution of (i). f(z) = az + f. It is clear. 
Solution of (ii). f(z) = az. It is also clear. 
Solution of (iii). We have 


(2) -h@lv@l 


2 
If) +1FO)I 
——— 


16.6 Trigonometric Functional Inequality 617 


Hence f(z) satisfies the condition of the theorem. Hence we have f(z) = 0, or 
f(z) = exp(az), where a is an arbitrary complex constant. 
Solution of (iv). Putting g(z) = f(e*), by (iv) we have 


g(x +y) = g(x)g(y). 


By the result above, we have f(z) = O or f(z) = 1 or f(z) = z”, where n is an 
arbitrary natural number. 

Solution of (v). (v) has solutions 0, exp(ax + £), where a and # are arbitrary 
constants. 

Solution of (vi). By (vi) we have 


If + y+ IF @ — y= 2IF@)I, 


which implies 


I (=) “ ee 


Hence f(x) satisfies the condition of our theorem, and we can conclude that 
f(x) = (az+fy*, g(z) =e a2z?, where a, f are both arbitrary complex constants. 

Solution of (vii). By the result above, we have f(z) = az, where a is an 
arbitrary complex constant. 


16.6 Trigonometric Functional Inequality 


(a) The functional inequality 


(16.14) If@+y)l Ss IF@sO) +1f Os@)I, 
where f, g are entire functions (cf. (4.73)). 


Result 16.9. [348]. If f and g are entire functions with f(0) = 0, f’(0) 4 
0, g(0) = 1, and satisfy the inequality (16.14), then the solutions are given by 


f@) = ae sindz, g(z) =e” cosdz, a,540, 
and 
f@=aze®, gee”, a0. 


From the result above, the solutions of the following functional equations (A) 
and (E) can be deduced: 


(A) faty)=f@a)+fQ), 
(E) f@t+y=f@FO), 
(4.73) fat+y)= f)eQ) + fO)g@). 
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Solutions 
(A): ax, where a is an arbitrary complex constant. 
(E): 0, exp(ax), where a is an arbitrary complex constant. (Put F(x) = xf (x).) 


s es =0, 
g(x) arbitrary; 


(ii) f(x) = aexp(bx), where a, b are arbitrary complex 
ii 
g(x) = 5 exp(bx); constants 
(ii) f (x) = aexp(bx) sinex, where a, b, c are arbitrary complex 
iii 
g(x) = exp(bx) cos cx; constants 
: x) = ax exp(bx where a, b are arbitrary complex 
gy) [£00 =arexptbx), here a, b are arbitrary compl 
g(x) = exp(bx). constants 


(b) The functional inequality | f(x + y) f(x — y)| < |f@)? + If)? 
Considering Cauchy’s additive functional equation 
fat+y)=f@)+fO), 


where f (x) is an entire function of x, we have 


fa —y)=f@)- fo), 


and then 
faty)f—-y) = f?@)- £70). 


Hence we have 


(16.15) If +y)f@—-y)I <1f@)? +1F OP. 


Now we shall determine all the entire functions that satisfy (16.15). 


Theorem 16.10. [348]. [f f(x) is an entire function of x, then all the functions 
that satisfy (16.15) are exp(ax + £), asin Bx, ax, where a, fh are arbitrary 
complex constants, and only these. 


Proof. Case (i). f(0) 4 0. Putting y = x in (16.15), we have | f (0) f(2x)| < 
2| f (x)/? in |x| < 00. By Lemma (i), we have in |x| < +00 


f O)f (2x) = Cf*(x), 


where C is a complex constant with |C| < 2. Putting x = 0, by f(O) 4 0, we 
have C = |. Hence we have in |x| < +00 


(16.16) FOfeN= FO). 
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Since f(0) 4 0, choosing a positive constant r properly, we have f(x) 4 0 
in |x| < r. Hence there exists a regular function (x) in |x| < r such that 
f (x) = exp(¢(*)) in |x| < r. By (16.16), we have in |x| <r 


P(0) + P2x) = 26(x). 


Using the power series (x) = sae Gnx" in |x| < r and equating the coeffi- 


cients on both sides, we have 

2" an = 2a, (n=1,2,3,...). 
Hence we have a, = 0 (n = 2,3,4,...). Hence we have #(x) = ag + a x. 
Hence, if f(0) 4 0, the solution of (16.15) is exp(ax + £), where a, # are 


arbitrary complex constants, and only this. 
Case (ii). f (0) = 0. We may assume that f(x) 4 0. Using a real parameter, f, 


pt) = (FP + 1f ee)? — FO + te?) Fy — te’??? 


has a minimum p(0) = 0 by (16.15). Here y is an arbitrary complex constant 
that satisfies f(y) 4 0 and ¢ is an arbitrary real constant. Hence we have 


p" (0) = 0. 
By Lemma (ii), we have 
Re(e#(F)F"() — F°O NFO) $ IF OF OP. 
Choosing ¢o properly, we have 
Ree (FY) FO) — FRO NLA) = IF O)F"0) — POD? O)L. 

Hence, by f(y) 4 0, we have 

FOF’ O)- £7?) <1F OP. 
Hence we have the following inequality at every point x: 

IFO) F@) — £72) < FOP. 
By Liouville’s Theorem, we have in |x| < +oo 
(16.17) f@) fe) — PR @)=C, 


where C is a complex constant with |C| < | f’(0)|*. Differentiating both sides 
of (16.17), we have in |x| < +00 


(16.18) FQ)" @=f @)si' eH. 
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(c) 
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Since f(x) ¥ 0, choosing a neighbourhood V properly, we have f(x) 4 Oin V. 
Hence, by (16.18) we have in V 
" ! 
e ) - 
f(x) 


f"@) _, 
F() 


Hence we have 


where k is a complex constant. 
Solving this differential equation, if f (0) = 0, then the solutions of (16.15) are 
asin fx, ax, where a, # are arbitrary complex constants, and only these. This 
proves the result. 


Applications. By the theorem above, we can solve the following functional 
equations under the hypothesis that f is an entire function of x: 

G) fet+y)= f@)+fO), 

Gi) fx +y)= f@)fO), 

Git) fa +y) fo —y) = £7), 

dv) [f@ + y)P? +1F@ — vy)? = 21fF@P + 2F OD, 

W) f@+yf@-y) = f°) - £76), 

(vi) fa@+y)f@-—y) =(F@)- FO), 
(vit) fe +y)+ f@—y) =2fQ) FO). 


Solution of (iv). Since 2ab < a? + b?, where a, b are real, we have 


Afae+tyfa-yi<If@+yP+1f@—- yD. 


Hence, by (iv) we have 


If +y)f@ —y) <1F@O)P +1F ODP. 


Hence f(x) satisfies the condition of our result. Hence we have f(x) = ax, 
where a is an arbitrary complex constant and the solution of (iv) is only this. 
((v) asin Bx, ax, where a, f are arbitrary complex constants. (vi) a sin? px, 
ax”, where a, f are arbitrary complex constants. (vii) 0, cos ax, where @ is an 
arbitrary complex constant.) 

The functional inequality 2| f(x) f(v)| < IFO F@ +») +1F@ — yD 


Considering the functional equation 


FOF +y) + f&—y) =2fOFO), 


where f (x) is an entire function of x, we have the following functional equation: 


(16.19) AF@MFO! SIFOMFE@ + WI +IF@ — y)D- 


Now we shall determine all the entire functions that satisfy (16.19). 


(d 


(e 


Ww 


wm 
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Result 16.11. [348]. If f(x) is an entire function of x, then the functions that 
satisfy (16.19) are acos” Bx, aexp(Bx7), where a, PB are arbitrary complex 
constants and n is an arbitrary natural number, and only these. 


Applications. By using Result 16.11, the solutions of (A), (E), (Q), (C), and 
f(x+y)f@ — y) = f?(x) f?(y) can be deduced (see [348, p. 14]). 

The functional inequality | f*(x) f(y + Jf(y -—2)1 < IPO) F@ + DFO - 
I+ If? @F&+y)f& — y)I 


Considering the functional equation concerning the sigma function in the theory 
of elliptic functions 


PROFOtDFO-DtLPO)FE+HNFIE-—HD+ FOF A+) Fe—y) =0, 


where f is an entire function of x, under the consideration that f is an odd 
function of x that follows from the equation above, we have 


IPOFO+DfO-—DI<ILPOFA@+DFO-DI 
HIP? @)F + y) fF —-y)I.- 


Now we shall determine all the entire functions that satisfy (16.20). 


Result 16.12. [348]. [f f(x) is an entire function of x, then the functions that 
satisfy (16.20) are the following and only these: 
G) f(x) = exp(ajx + a2x + a3), where a1, a2, 43 are arbitrary complex 
constants, 
(ii) f(x) = exp(aix? + a)o (2), 
f (x) = x exp(aix? + a2), 
f (x) = exp(a1x? + a2) sin(a3x), 
where 1, 2, a3 are arbitrary complex constants. 


(16.20) 


Applications. By using Result 16.12, the functional equations (A), (E), (Q), (S), 
fe+ynfe-y =/@P’O), f@+YF@-—y) = o@)¥O), f@+ 
yf —y) = f2@), Fe +y) + F& — y) = 26(x) + 2) can be solved 
(see [348, p. 18]). 

The functional inequality 2| f(x)g(y)| < |f@ + y)| + 1f@ — y)| 


If f, g are entire functions with g(0) = | and satisfy the functional inequality 


(16.21) Af@sOl Ss IFO + WI +1F@— YI, 


then f(x) = acos"(bx +c) or f(x) = exp(ax? + bx +c) or f(x) = (ax + 
b)", where a, b, c are arbitrary complex constants and n is an arbitrary natural 
number. The solutions of (16.21) are only these. 

Proof. Putting h(y) = 4(g(y) + g(—y)), we have 


2| f(x)h(y)| < ea Ae eel 
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From (16.21), we have 


2\f@)g(—y)l Ss If@ — y+ IF @ + y)I- 


Thus we have 


(16.22) AFA Ss IF@+WI+IF@ —y)I, 


where A(0) = 1. 
Since ooh <,/ ae where a, b are real, by (16.22) we have 


(16.23) AFA)? <If@+yP +1f@— yD. 


Since (0) = 1, there exists a positive number 6 such that h(y) 4 Oin |y| < 6. 
Let us assume that y is an arbitrary complex constant and put y = te’?, where 
t, d are real with |t| < 0. 

Putting x = y, y = fe’? in (16.23), we have in |t| < 6 


f(y +tel)| | f(y — tel?) /? 


2 
2FQ) s h(te'?) h(te'?) 


From this inequality, we have the following differential equation in the same 
way as in Result 16.11 (considering that /(y) is an even function of y and that 
h(O) = 1, h'(0) = 0): 


f(x) fe) — AO) f7 (x) = CF? (x), 


where C is a complex constant with |C| < 1. Solving this differential equation, 
the theorem is proved. 


Applications. By the theorem above, we can solve the following functional 
equations and functional inequalities under the hypothesis that f, p, qg are entire 
functions. 

G@ f@+y)+ f@—y) =2p@)q0), 

Gi) f@ +y)— fF —y) = 2p@)qO), 

Gil) f@+y)f@—y) = #7) — f°), 

Gv) f@+f@-y) = /O)+/O)-1, 

(v) f@ + y)f@ — y) = p@)aqy), 

(vi) Ff +y) + f@ — y) = 2p) +240), 

(vii) | f (=>)| < Lore rot ' 
(viii) ASSO <IFOMF@ +14 1F@ — yD, 

(ix) |f(« + iy)|* = d(x) + wy), where x, y are real and A(x), y(y) are 
real-valued functions, 
(x) [f@ + iy)? = If @)F + FG)? (, y real), 
(xi) [fF + iy)? = IF @)? + If)? — LG, y real). 
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Solution of (i). We may assume that g(0) = 1. Putting y = 0 in (i), we have 
p(x) = f(x). Hence, by (16.21), we have 


fa@ty)+ f@—y) =2f@)q0). 


Hence we have 


ZF @aOl s IF@+y1+|F@ — yD, 


where q(0) = 1. 

By the theorem above, we have f(x) = asinax+bcosax or f(x) = ax+b, 
where a, b, c are arbitrary complex constants. The solutions of (i) are only 
these. 

Solution of (ii). Differentiating both sides of (11) with respect to y, we have 


f(x ty)+ f'@ — y) =2p(x)q'(). 


Hence, by the result of (i) above, we have f(x) = asinax + bcosax +c or 
f@me= ax* + bx +c, wherea, b, c, aare arbitrary complex constants. 
Solution of (iii). Putting x + y, x — y inplace of x, y, respectively, in (ili), we 
have 


Paty)- fa —-y) = fQx)f Qy). 


Hence, by the result of (ii) above, we have f(x) = Csinax or f(x) = Cx, 
where C, a are arbitrary complex constants. The solutions of (iii) are only 
these. 

Solution of (iv). Putting x + y and x — y in place of x and y, respectively, in 
(iv), we have 


Ft y+ fP@—y)—1= FRx) fy). 
Putting F(x) = f7(x), by (16.21) we have 
F(x t+ty)+F@—y)—-1= fx)fy). 
Differentiating both sides of the equation above with respect to y, we have 


F’(x + y)— F'Q@ — y) = 2f 2x) f'Qy). 


Hence, by the result of (ii) above, we have f(x) = -+Ecosax, where a is an 
arbitrary complex constant. The solutions of (iv) are only these. 

Solution of (v). We may assume that g(0) = 1 and g(y) 4 O in |y| < +o0. 
Putting y = 0 in (v), we have p(x) = f(x)*. Hence, by (v), we have 


VIF@ + WFO -WI=IFf@)vVIO), 
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where ./q(y) stands for an entire function of y that satisfies /g(y)? = q(y) in 
|y| < oo. Since 


2IFf@+wFa-—yI<lf@ +yI+lF@ — YI, 


we have 


2F@VaIO) S IFO + I+ IFO — y)I, 


where ./q(0) = 1. 

Finally, we have f(x) = 0 or f(x) = exp(ax? + bx +c), where a, b, c are 
arbitrary complex constants. The solutions of (v) are only these. 

Solution of (vi). Putting F(x) = exp(f(x)), d@) = exp(p(*)), wr) = 
exp(q(x)), by (vi) we have 


FatyFa-y=¢')y*(). 


By the result of (v) above, we have f(x) = ax* + bx +c, where a, b, c are 
arbitrary complex constants. The solution of (vi) is only this. 
Solution of (ix). By (ix) we have 


Ife+yP+1f@-—y)P =f +H? +1FG — HI, 


where x, y are complex and 


f@+y)+ f@—y) =2p(«)qQ), 


where p(x), g(y) are entire functions. 

Hence, by the result of (1) above, we have f(x) = asinax + bcosax or 
f(x) = asinhax + bcoshax or f(x) = ax + b, where a, b are arbitrary 
complex constants and a is an arbitrary real constant. The solutions of (ix) are 
only these. 

Solution of (x). By the result of (ix) above, we have f(x) = C sinax or f(x) = 
C sinhax or f(x) = Cx, where C is an arbitrary complex constant and @ is an 
arbitrary real constant. The solutions of (x) are only these. 

Solution of (xi). By the result of (ix) above, we have f(x) = e” cosax or 
f(x) = e! coshax, where 0, are arbitrary real constants. The solutions of 
(xi) are only these. 


16.7 Cosine and Sine Functional Inequalities 


Result 16.13. (Badora and Ger [83]). 
(i) Let G be an Abelian group and let f : G + Cand ¢: G => R satisfy the 
inequality 


(16.24) |fa@+y)+f@—-—y)-2f@)fMI SO), forx,y €G, 
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or 


(16.24a))  |f@+y)+f@—y)—-2f@)fO)S¢@), forx,y EG. 


Then f is either bounded or satisfies (C), the cosine equation. 

(ii) Let G be an Abelian group, H a semisimple commutative Banach algebra, and 
f:G— Hand¢:G = R satisfy (16.24) or (16.24a). Then f is a solution of 
(C) provided that, for an arbitrary linear multiplicative functional x* € B*, x*o 
f fails to be bounded. 


Remark. It would be desirable to get rid of the somewhat awkward hypothesis that 
all the superpositions x* o f are unbounded, assuming simply that f itself fails to be 
bounded. However, as shown by the counterexample below, the assumption that f 
yields an unbounded solution of inequality (16.24) (resp. (16.24a)) is insufficient to 
force f to satisfy equation (C). In fact, consider a Banach algebra H of all diagonal 
2 x 2 matrices with complex entries and a function f : R — A given by the 
formula 
cos ax 0 


rey =| | (x ER), 


where a € C\R and b € C\{0, 1} are arbitrarily fixed. Then f yields an unbounded 
solution to both the inequalities (16.24) and (16.24a) with an arbitrary majorizing 
function ¢ : R — R enjoying the property inf{f(t) : t € R} > 2|b|- |1 — DI. 
Nevertheless, f fails to satisfy d’ Alembert’s equation (C). 

What is missing? The superposition of f with the linear multiplicative functional 


x* : H — C given by 
*([or)) =" (Lo}e") 


is constant and hence bounded. 
Corollary. A function f : R — C is of the form 

f(x) =cosax (x €R), 
with some constant a € C\R, if and only if 


(i) f is Lebesgue measurable; 
Gi) f is unbounded; and 
(iii) there exists a function ¢ : R — R such that 


If@+y)+ f@—y)—-2f@) FOS ¢0), forallx,y ER, 


or 


If@+y)+ f@—y)—2f@)FO)| SO), forallx,y ER. 
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Result 16.14. [83]. 


(i) Let G be a uniquely 2-divisible Abelian group and let f :G — Cand¢:G—> 
R satisfy the inequality 


x+y x= y 2 
(16.25) SOTO =F (=) a r( 5 ) < $() 
or 

x+y x-y : 
(16.25) rong (FE) +4 (2) ] 009 


for x,y € G. Then either f is bounded or satisfies (S), the sine equation. 

(ii) Let G be an Abelian group, H be a semisimple commutative Banach algebra, 
and f :G > H and¢: G => R satisfy either (16.25) or (16.25a). Then f 
satisfies (S), provided that, for an arbitrary linear functional x* € H*, x* o f 
fails to be bounded. 


16.8 Functional Equation Concerning the Parallelogram 
Identity— Quadratic Inequality 


Considering Cauchy’s functional equation (A), where f is an entire function, we 
have the following functional inequality: 


(16.26) If@ + yI+IF@ — YI S2IF@)1+21F ODI. 
All the entire functions that satisfy (16.26) are determined in the following theorem. 


Theorem 16.15. [348]. [f f is an entire function, then all the functions that satisfy 
(16.26) are a, ax, ax”, where a is an arbitrary complex constant, and only these. 


Proof. Putting y = x in (16.26), we have in |x| < +00 


[f(2x)| < l4f@)I. 


By Lemma (i), we have in |x| < +00 


f (2x) = Cf (x), 


where C is a complex constant with |C| < 4. 


Using the power series 
+00 


f(x) = ae 


n=0 


in the equation above and equating the coefficients of both sides, we have 


an = Ca, (n=, 1,2,...). 
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(i) If C = 1, then we have a, = 0 (n = 1,2,3,...). 
(ii) If C = 2, then we have a, = 0 (n = 0, 2,3,... 
(iii) If C = 4, then we have a, = 0 (n = 0, 1,3,... 

(iv) Os 1.2.4, henweluved, =0G=0.1,2,:. .). 


-~LY 


We now consider the functional inequality 


(16.27) If@ +P +1F@ — WP = FG)? +21F OP, 


where f is an entire function. 
Inequality (16.27) is connected with the parallelogram inequality (parallelogram 
identity when p = 1) 


[x + y|?P + |x — y|?P > 2x7? + [2]y1??, 


where x, y are complex variables and p is an arbitrary natural number. 
Inequality (16.27) implies the functional inequality 


(16.28) F@+y)P +1F@—y)P 2 2F@Y — FO): 
where f is an entire function. 


Result 16.16. (Haruki [351]) FF is an entire function of z and satisfies (16.28), 
then and only then f(z) = az” or f(z) = asinaz, where n is 0 or an arbitrary 
natural number and a, « are arbitrary complex constants. 


Corollary. The entire solutions of (16.27) are obtained from that of (16.28). The 
only entire solution of (16.27) is ax", where n € Z4 and a is an arbitrary complex 
constant. 


16.9 Inequalities for the Gamma and Beta 
Functions via some Classical Results 


16.9.1 Inequalities via Chebychev’s Inequality 


The following result is well known in the literature as Chebychev’s integral inequal- 
ity for synchronous (asynchronous) mappings. 


Let f,g,h: I CR — R be such that h(x) => 0 forx € I andh, hfg, hf and 
hg are integrable on I. If f, g are synchronous (asynchronous) on I (i.e., we recall 


(f(x) — FONG) — 80) 2 (S) 0 forallx,y et), 


then we have the inequality 


(16.29) [neoax [noofeecoras > (2) [meoyrenax | nerecoras. 
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A simple proof of this result can be obtained using Korkine’s identity 


[ncoas | h(x) f &)g(x)dx — [rcoresas | nergeoas 
(16.30) */ - : : 
=5 | [ mcomonre) - fone) eoyasay. 
The following result holds. 
Theorem 16.17. [228]. Letm, n, p, q be positive numbers with the property that 


(p —m)(q —n) < (=) 0. 


Then 

(16.31) B(p, q)B(m, n) = (S) B(p, n)B(m, g) 
and 

(16.3 1a) T(p +n)P(q +m) = (s)T(pt+q)Pm +n). 


Proof. Define the mappings f, g, : [0, 1] — [0, co) by 

f(x) =xP-™, g(x) = (1— x)", and A(x) = x" 1 — xy". 
Then 

f'@) =(@—m)xP "|" '@)=@-ga-xt" "|, xe 0,1). 


As, by (16.30), (p — m)(q —n) < (©) 0, then the mappings f and g are synchro- 
nous (asynchronous), having the same (opposite) monotonicity on [0, 1]. Also, A is 
nonnegative on [0, 1]. 

Writing Chebychev’s inequality for the selection of f, g, and h above, we get 


1 1 
i Fiat _ yt lax f x™ la =x) "sh" _ x)t"dx 
0 0 
1 1 

> © | x™ la — ay tae mas | ie a _ xy ta =x)0 "dx. 

0 0 
That is, 
1 1 
| ee ie ot tas f xP! — x)t!dx 
0 0 
1 1 
> © / sl — ay ldx f a= ait de, 
0 0 


which, by (14.36), is equivalent to (16.31). 
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Now, using (16.31) and (14.34), we can state 


PpP)r@ © Tm) (n) sj PPT) I'(m)T(q) 
T(p+q) T(m+n) — T(p+n) T(m+q)’ 


which is (16.31a). This proves the result. 


The following corollary of Theorem 16.17 may be noted as well. 


Corollary. For any p,m > 0, we have the inequalities 


(16.32) Bm, p) = [B(p, p)B(n, m)}? 
and 
(16.32a) T(p +m) > (Tp)TQm)]2 


Proof. In Theorem 16.17, set g = p andn =m. Then 
(p —m)(q —n) = (p—m)’ > 0 


and thus 
B(p, p)B(m,m) < B(p,m) Bm, p) = B*(p,m), 
and the inequality (16.32) is proved. The inequality (16.32a) follows from (16.32). 


The following result employing Chebychev’s inequality on an infinite interval 
holds. 


Theorem 16.18. [228]. Let m, p, and k be real numbers with m, p > 0 and p > 
k > —m. If 


(16.33) k(p —m —k) > (s)0, 


then we have 


(16.34) P(p)V(m) = (Ss) P(p — k)P(m + k) 
and 

(16.34a) B(p,m) = (S) B(p—k,m +k), 
respectively. 


Proof. Consider the mappings f, g,/ : [0, co) — [0, oo) defined by 
f(x) = gh, g(x) = — xk , h(x) = =" 1, ax 


If the condition (16.33) holds, then we can assert that the mappings f and g are syn- 
chronous (asynchronous) on (0, oo) and then, by Chebychev’s inequality on [0, oo), 
we can state 
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lore) fore) 
| sta) Pk yk ym—1 x gy 
0 0 


love) ee) 
> ‘© / ee a ad xk le dx: 
0 0 


Co CO 
| wmtesdx | xPle-*dx 
0 0 


CO [o,@) 
= © / wrt leds | gle gt. 
0 0 
Using the integral representation (14.4), the inequality above provides the desired 


result (16.34). On the other hand, using (14.34) and 


T(p—-—k)T k 
Bip — km +k) = POT 


we can easily deduce that (16.34a) follows from (16.34). This proves the result. 


The following corollary is of interest. 


Corollary. Let p > 0 and q € R such that |q| < p. Then 

T(p) <T(p—4)l(p +4) 
and 

B(p, p) < B(p— 4, P+ 4). 
Proof. In Theorem 16.18, choose m = p and k = q. Then 

k(p —m—k) =—q* <0, 

and by (16.34) we get the first inequality, 

(p) <T(p— 4) +4). 


The second inequality follows from it. 


Definition. The positive real numbers a and b may be called similarly (oppositely) 
unitary if 


(16.35) (a —1)(b—1) = (SNS) 0. 


Theorem. Let a,b > 0 and be similarly (oppositely) unitary. Then 


(16.36) T(a +b) => (<) abl (@Tb) 
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and 
1 
(16.36a) B(a,b) = (<) —, 
ab 
respectively. 
Proof. Consider the mappings f, g,4 : Ry, — Ry given by 
fOQ=t71, eth=te!, and hi) =te™. 


If the condition (16.35) holds, then obviously the mappings f and g are synchronous 
(asynchronous) on R, and by Chebychev’s integral inequality we can state that 


[o.@) lo) [o.@) [o.@) 
| revtar | pee Late = © / reat | tredt 
0 0 0 0 
provided (a — 1)(b — 1) = (<) 0; ie., 


(16.37) PQ)ra+b)> (S)P@+bDret+d. 


Since [(a + 1) = al (a), T(b+ 1) = DI (b), and [(2) = 1, (16.37) becomes 
(16.36). The inequality (16.36a) follows by (16.37). This proves the theorem. 


The following corollaries may be noted as well. 
Corollary. The mapping log T(x) is superadditive for x > 1. 
Proof. If a, b € [1, 00), then, by (16.36), 

log l(a +b) = loga + logb + log T'(a) + log T'(b) = log l(a) + log (db), 
which is the superadditivity of the desired mapping. 


Corollary. For every n € Z’,, n > 1, anda > 0, we have the inequality 


(16.38) T(na) > (n— 1)!a?"- YT (a)]". 


Proof. Using the inequality (16.36) successively, we can state that 
T(2a) >a7T(a)0 (a), PGa) > 2a? (2a) (a), P4a) > 3a7P Ba)T (a), 
..., (na) > (n= Na7T[( — la]E (a). 
By multiplying these inequalities, we arrive at (16.38). 
Corollary. For any a > 0, we have 


92a-1 iT 
T < Tr —]}. 
@= Jaa (« 7 ;) 
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Proof. Using (14.13) since [(2a) > a?I'?(a), we arrive at 
1 
2b (@\r (« & ;) > /na*T*(a), 


which is the desired inequality. 


For a given m > 0, consider the mapping T’,, : Ry > R, 


T(a+m 
Iin(x) = — 

The following result holds. 

Theorem 16.19. [228]. The mapping Tn is supermultiplicative on [0, 00). 


Proof. Consider the mappings f(t) = t* and g(t) = t”, which are monotonically 
nondecreasing on [0, 00), and A(t) := t’”~!e~' is nonnegative on [0, 00). 

Applying Chebychev’s inequality for the synchronous mappings f, g and the 
weight function 1, we can write 


ioe) ioe) oe) love) 
fl lear | petytm—lo-t ay > | rem letar f prm-le-tag. 
0 0 0 0) 


That is, 
Tim)Tat+y+m)>=lat+m)r(y +m), 


which is equivalent to 
Pin(® + y) = Pn @)PnQ), 


and the result is proved. 


16.9.2 Inequalities via Hélder’s Inequality 


Let J) € R be an interval in R, and assume that f € Lpy(U2), g € Lg U2) (p > 
1, +=) ie, 


| lf (s)|?ds, | le(s)|2(ds) < oo. 
bh h 


Then fg € Li(/2) and the following inequality due to Hélder holds: 


7 
<(J iro as) (/ le(s)! as) . 


Using Holder’s inequality, some functional properties of the mappings gamma 
and beta are given under the following result. 


(16.39) / f(s)g(s)ds 
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Theorem 16.20. [228]. Leta,b > Owitha+b=1andx,y > 0. Then 
(16.40) Tax + by) < IP@)NMI Os 
i.e., the mapping T is logarithmically convex on R*,. 


Proof. We use the weighted version of Hélder’s inequality 


- ¢ Foh(oods)" (/, je)Ih(o)ds) 


= 1, h nonnegative on Jy, and provided all the other integrals exist 


(16.39a) / S(s)g(s)h(s)ds 


1 1 
for p = 1, rie 


and are finite. 
Choose 


f(s) = gee-), g(s)= s20-D and h(s)=e*, s €(0,0) 


in (16.39a) to get (for Jy = (0, 00) and p = 1, q= ) 


fore) 0° i a fore) : b 
| stl) : s29-De-Sds < (/ st0-Yte‘ds) (/ sH0-Whe-tds 
0 0 0 


which is clearly equivalent to 


lee) ee) a fore) b 
| gern leds 2 (/ tetas) (/ tetas) ‘ 
0 0 0 


and inequality (16.40) is proved. 


Remark. Consider the mapping g(x) := log I(x), x € Ri. We have 


_’@le)=F@)r 


= I’(x) and g(x) = rx 5 
Xx 


~ T(x) 


for x € (0, co). Using the inequality, we conclude that g” (x) > 0 forall x € (0, 00), 
which shows that I is logarithmically convex on (0, 00). 


g(x) 


We now prove a similar result for the beta function. 


Theorem 16.20a. [228]. The mapping f is logarithmically convex on (0, 00)? as a 
function of two variables. 


Proof. Let (p,q), (m,n) € (0, 00)? anda,b > 0 witha + b = 1. We have 
Bla(p, gq) + b(m,n)] = B(ap + bm, ag + bn) 


1 
ay pera _ prereel ge 
0 


1 
ai pee OG _ ra ie dae 
0 


-[ [eta net] x feta — n'Y at 
0 
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Define the mappings 
a 
fo=[rta-o-'], teh, 
b 
g(t) = tage = 2 | , £ET, 


and choose p = 4, q= t (Gtfsatb= 1, p> 1). 
Applying Hoélder’s inequality for these selections, we get 


1 a b 
p-l = q-1 pol t= q-1 d 
Lf (1 —1f) | E (1 —f) | t 
1 “ 1 p 
ai mor at| «| mana] : 
0 0 


Bla(p, q) + b(m, n)] < [B(p, g) "18m, n) 1? 


which is the logarithmic convexity of £ on (0, 00). This proves the result. 


1.€., 


Closely associated with the derivative of the gamma function is the logarithmic- 
derivative function 


d I’ 
V(x) = qx OBE) = — x £0,—-1,- 


The function ‘¥ (x) is also commonly called the psi function. 


Remark. The digamma function is monotonically nondecreasing and concave on 
(0, 00). 
16.10 Simpson’s Inequality and Applications 


Introduction 


The following inequality is well known in the literature as Simpson’s inequality (see 


[228]), 
Gare a1 2 (*)] 
(16.41) 2 
1 | ea ue 
= 3880 = IF |. <2 


where the mapping f : [a,b] — Ris assumed to be four times continuously differ- 
entiable on the interval (a, b) and the fourth derivative is assumed to be bounded on 
(a, b); that is, 


|| = sup fo < ©. 
&9 x€(a,b) 
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Now, if we assume that J, : a = x9 < X1 < +++ <Xn-1 < Xn = bisa partition of 
the interval [a, b] and f is as above, then we have the classical Simpson quadrature 
formula 


b 
(16.42) / FOE = As TI LRG TD, 


where As(f, In) is the Simpson rule 


n—1 


1 
As(f. In) = = DULf Oa) + FON + 5 2S (Se), 


i=0 i=0 


and the remainder term Rs(f, In) satisfies the estimate 


1 n—-1 

(4) 5 

Rs(fJn)l = aan | F |. 20% 
i= 


where hj := xj41 — x; fori =0,...,n —1. 
When we have an equidistant partitioning of [a, b] given by 
b— 
In i Xj = at ee £05 ving BS 
n 


then we have the formula 


where 


b 
(16.42a) / fOIde As Ce Rea). 
h=a n—-1 
Asnl(f) = bh E (« + 
ut i=0 


-—a . b-a_. 
i) +t (ar-=*-¢40)| 
n—-1 


ome a —a a). 


and the remainder satisfies the estimation 


(b= a) | 2) 

I, 

We point out some very recent developments on Simpson’s inequality for which 
the remainder is expressed in terms of derivatives lower than the fourth. 

It is well known that if the mapping f is not four times differentiable and the 
fourth derivative f is not bounded on (a,b), then we cannot apply the classical 
Simpson quadrature formula, which actually is one of the quadrature formulas used 
most often in practical applications. 


R Sy 
IRsn(f)| < 7880 


Simpson’s Inequality for Mappings of Bounded Variation 
The following result holds. 
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Result 16.21. [228]. Let f : [a,b] — R be a mapping of bounded variation on 
[a, b]. Then we have the inequality 
(16.41a) 


(yar P54. [LOFLO 57 (22%) 


2 


I b 
<76-a\/(P), 


b bes ; 1 
where \/ ,(f) denotes the total variation of f on the interval [a, b]. The constant 5 
is the best possible. 


Corollary. Suppose f : [a,b] > R is a continuous differentiable mapping with 


b 
inh =f | f’(x)|dx < oo. 


Then the following inequality holds: 
(16. a) 


FOTIe) .. a+b 1 
[roa-? ad ee : +2f(S*)]|< sl fhe ay. 


16.11 Applications for Special Means 


Let us recall the following means: 


e the arithmetic mean: 


A(a, b) := , a,b=0; 
e the geometric mean: 
G(a,b):=~Vab, a,b>0; 


e the harmonic mean: 


2 
H(a,b):=7—;, 4,b>0 
atb 
e the logarithmic mean: 
b-a 
€(a, b) := ——, a,b>0, ab, €(a,a) =a; 
Inb—Ina 


e the identric mean: 


1 
1 Ba 
1a,b) = ~(=) , a,b>0, ab; and 
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e the p-logarithmic mean: 


pet! — qptl 
ec e No—a) 


It is well known that €,(a, b) is monotonically nondecreasing over p € R with 
€_; := € and €9 := €(a, b). In particular, we have the inequalities 


H<G<t< (a,b) < A(a,b). 


L 
Pp 
, peR\{-1,0}, a,b>0,a4¢b. 


Using Result 16.21, some new inequalities are derived for the means above. 
Let f : [a,b] > RO <a <b), f@)=x?, p € R\{-1, 0}. Then 


1 
— [ f(x)dx = €,(a,b), 


FO) + 10) _ gear. o, 
f(S*) = ara 


and 


-1 
If/ll = |pI@— ae j,  p € R\{-1,0, ]}. 
Using inequality —_ we get 


P| pp 


e3 (a,b) — NG P bP) — =AP (a b) a Pb a)’. 


The following result holds. 


Result 16.22. [228]. Let f : [a,b] — R be an L-Lipschitzian mapping on [a, b]. 
Then we have the inequality 


b 
| foyax 958 [LORI 2p (244)]) <3 


—l(a, b)(b-— a). 


~ 36 


Applications for Special Means 


Using Result 16.22, we now point out some new inequalities for the special means 
defined earlier. 


(i) Let f :[a,b] > RO <a <b), f(x) =x?, p € R\{-1, 0}. Then 


pb? if p >1, 


IF lloo = dp, b) := le if p € (—oo, 1)\{—1, 0}. 


Then we get 


1 2 5 
€5(a, b) — zAta?, b?) - 34° (a,b) a 35 07 (4, bY —a). 
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(ii) Let f : [a,b] > RO <a <b), f(x) = 4. Then 


IIflloo = 
a? 
Then we get 


3H (a, b)A(a, b) — €(a, b)A(a, b) — 2€(a, b)H (a, b) 


5 —a 
< Dp’ a2 €(a, b)A(a, b)H (a, b). 


Simpson’s Inequality in Terms of the p-Norm 
The following result holds. 


Result 16.23. [228]. Let f : [a,b] — R be an absolutely continuous mapping on 
la, b] whose derivative belongs to L p[a, b]. Then we have the inequality 


b 
| ed 27 ()] 


= 


1| 297141 
6 | 3(¢4+1) 


1 
q L 
(b—a)'TSIf'llp, 


where + +2=1, p>. 


16.12 Inequalities from Information Theory 


The well-known Shannon entropy (SE) 
n 
H,(P) = — > pi log pi 
i=1 
satisfies the Shannon inequality 
n n 
(16.43) — > pelog pe < — >- pr log ae, 


where (Px); (qk) eT. 
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Indeed, logx < x — 1 forx > 0. 
y=x-l1 


log x 


Put x = “~, k = 1 ton so that 


Pk? 
n dk dk 
> pe: log = <> px (#- i) = 0, 
= Pk Pk 


which is (16.43). The Shannon inequality plays an important role in coding theory, 
where it provides the best lower bound for the average code length. 
Also, from directed divergence (dd), 


n 
Dk 
Dn(PI|O) = > pelog * > 0 
k=1 dk 


follows (16.43). The inequality (16.43) above can be expressed in the functional 
form 


(16.44) > ref (Pr) = >- ref qu): 


k=1 k=1 


where f : Io > R, (px), (gk) € re. 

There are several interpretations of the Shannon functional inequality (16.44). 
One of the interpretations is connected with the problem “how to keep the expert 
honest” [15, 22]. I.J. Good considered the problem of how a firm can encourage its 
forecaster to give fair and accurate estimates of probabilities. In general, the problem 
is complicated, so he considers only a simple case. Suppose, for instance, an expert is 
asked to estimate a certain probability distribution regarding the occurrence of some 
events (outcome of an experiment, market situation, weather, etc.). He gives this as 
(41, 92;-+->9n), While his (subjective) probabilities for the occurrence of the same 
events are (p1, P2,.--, Pn). If he agrees to be paid only after the occurrence of the 
event (market situation, etc.) is known and his payoff will be (ideally) f (gx) if the kth 
event happens, then his expected earnings will be }°7_, px f (gx). In order to keep 
the expert honest, it is natural to choose the payoff function f so that the expert’s 
expected earnings will be maximized if he has given his probabilities. Hence (16.44) 
holds. Inequality (16.44) can also be given a nice interpretation in connection with 
the reproducing scoring system. 
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The inequality (16.44) was extensively studied in [15, 46, 61, 62, 43, 273, 641, 
505] assuming various regularity conditions on the unknown function. The general 
solution of (16.44) without assuming any regularity condition is 


(16.45) f(p)=alogp+b (a>=>0, p €]0,1[. 


Several proofs are known. We will present the proof given in [15] and treat general- 
izations of (16.44) later. 


Solution of (16.44) from Aczél [15] 


The proof is done showing first that f is monotonically increasing and then that f is 
differentiable. 

Put in (16.44) pp = gx (kK = 3,4,...52), PL=P, N=, pPitp2=r= 
gi +4q2, r € ]0, I[, so that it goes over to 


(16.46) PYF (P= FE)= C— DGC =—M= 710 =P) 


for p,q € ]0,r[, r € JO, I[. 
Interchanging p and q in (16.46), we get 


(16.46a) qi (qg)= fo) = r= Qyrer =p) =F = q)). 


Multiply (16.46) by (r — g), (16.46a) by (r — p), and add to obtain 


rig— pif @)— fp) =. 


Ifg > p, then f(g) > f(p); that is, f is monotonically nondecreasing. 
From (16.46a) and (16.46), for g > p, we have 


fe p)— fra) = ~F@)— FO) 


— i — Ee = p= 70 =a): 
P= q Pp 


IA 


that is, 


r-a(fr—p)-fr—-a) _ f@— fl) 
q q=p —  g-—p 
r—p(fr-p)-fr Q)) 


P q—P 


Suppose now that f is differentiable at r — p in ]O, I[. As g — p, the first and last 
terms of the inequality tend to 


Pra) 
P 
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and then we conclude that f is differentiable at p also and 


(16.47) pf'(p)=(r— p)f'(r— p) for p €)0, 1[, r €]p, 11. 


Now we claim that f is differentiable everywhere on ]0, 1[. Indeed, if f were not 
differentiable at po € ]0, 1[, then f is not differentiable at r — po (r € ]po,i[); that 
is, f would not be differentiable on the nondegenerate interval ]0, 1 — po[. But this 
would contradict the fact that f is a monotonic function that is almost everywhere 
differentiable. Thus f is differentiable everywhere, and (16.47) shows that pf’(p) = 
0 (constant a > O since f is nondecreasing). Thus we obtain (16.45). This concludes 
the proof. 


16.12.1 Generalization 


The inequality 


n n 


(16.44a) > pr felge) < >> pe fe(Pr) 


k=1 k=1 


is a generalization of (16.44). The general solution of (16.44a) was given in [46] 
without assuming any regularity conditions on f,, and it was shown that the solutions 
Sx of the inequality (16.44a) are of the form 


(16.45a) Sk(p) =clogyp+bk, pelo, k=1,2,...,n, n > 2, 


where c > 0, bx are arbitrary constants. The inequality 


(16.44b) DY sof av) < >) spo f (Pw, 


k=1 k=1 


a generalization of (16.44), was treated in [274], and some variations of (16.44) were 
investigated in [277]. Now we determine the general solution of the inequality 


(16.44c) > gk(Pr) fe (qe) < > se(Pe) fe (Pr): 


k=1 k=1 


when it holds forn > 2, without assuming any regularity conditions on the functions 
Sx. The inequalities (16.44), (16.44a), and (16.44b) are special cases of (16.44c). 
We will provide an application of this to find the solution of an inequality connected 
with the generalized problem of “how to keep the inset expert honest” [22]. 


Solution of the Inequality (16.44c) 


We will prove the following theorem adopting the method found in [46]. 
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Theorem 16.24. (Kannappan and Sahoo [505]). Let fx : [0, 1| > R be real-valued 
functions and gx : \0, 1| + R*, be positive and strictly increasing continuous func- 
tions for every k (k = 1,2,...,n). For all P,Q € AQ. suppose fx and gx satisfy 
the functional inequality (16.44c) for arbitrary but fixed n > 3. Then each fx is a 
monotonically increasing function and differentiable everywhere. Further, there exist 
constants c > 0, a € JO, 1[, b1, b2,..., bn such that 


P dt 
(16.45c) fle) =e | ——+bhy (k=1,2,...,n). 
a gx(t) 


The converse is also true. 
Proof. Let pi) = g, gi = q, and p; = qi for alli > 2 in (16.44c). Then, after 


simplification, we get 


sip) filg)+ ar — pf —4@) < gilp)filp) + a2" — p) far — Pp), 


where p+ p2 =r =q+q2, forr € JO, 1[. The inequality above is equivalent to 


(16.48) si(pyLfi(p) — filg)] = g2 — PL —9@)— far — p)] 


for all p,g € JO, r[ andr e€ ]0, 1[. The domain on which (16.48) holds is symmetric 
in p and q. Thus, interchanging p and q in the above, we obtain 


(16.48a) sill fig) — fi(P)] = a2 - OLA — p)- fpr — 4). 


Since go is positive, multiplying (16.48) by go(r — qg) and (16.48a) by go(r — p) and 
adding, we get 


(16.49) (gi(p)g2(r — g) — gi(q)gor — p)(fi(p) — fig) = 9. 


Let us choose p and q in JO, r[ such that p > q. Since g; and g2 are strictly increas- 
ing, g1(p)g2(r — g) — g1(q)g2(r — p) > 0. Hence, from (16.49) we conclude that, 
forr € ]0,1[, p,g €]0,r[ whenever p > g fi(p) => fi(q). That is, f; is increasing 
on the interval ]0, 1[. Similarly, f, is monotonically increasing on Jo. 

Also from (16.48) and (16.48a), one obtains 


gr — p) A ~g— fr —p) _ fille) - fA@ 
sip) @=-@=—-C=-p) ~ pa 
28 =O) BYH=M=20=P) 
gi@)  G—-@g)-¢—p) ~ 
Thus, from (16.50), if fo is differentiable at r — p, then f) is differentiable at p. 
Hence (as g —> p) 


(16.51) gi(p) fi (Pp) = 820 — p) far — p). 


In other words, if f; is not differentiable at p, then /f2 is not differentiable at any 
r — p € ]0,1 — pl. But this is impossible: Since fo is monotonically increasing, 


(16.50) 
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f2 is almost everywhere differentiable. Therefore, f| is everywhere differentiable 
and (16.51) holds for all p € ]0,r[. Since r is arbitrary in Jo, it is easy to see that 
(16.51) holds for all p € Jo. Similarly, f2 and all f, (k > 2) are differentiable. Since 
(16.51) is true for every r in Jo, letting s = r — p in (16.51), we get 


gi(P) fi (P) = g2(s) fa(s) = (constant). 


Since g1 is positive, this equation yields 


ees 
fil—)= Fo 


Pod 
file) =e | “pte 


where c, a, and }, are constants. Similarly, 


Thus 


P dt 
(16.45c) Sk(p) =o} — + bx, 
a_ 8k (t) 
where c, a, and by are constants fork = 1,2,...,n. Since each f/f; is increasing 
c > 0 and since p € I, therefore a > 0. This completes the “only if” part of the 


theorem. 
What follows next is the proof of the “if” part of the theorem. Consider 


> eon ROS Saino ie. at (using (16.450) 


k=1 
_ Pk . he 8kK(PK) 4, 
= Z [ g(t) ore ye a Gy 


Pr>qk” TE Pk <4k 
2 > pee 8k(Pk) aed pe 8K(PK) ae 
Pk> qk” Tk BK) Pk <qk* Pk 8K) 


Since gx; is monotonically increasing, gx(px) => gx(t) for py => t > qx and gx(px) < 
gk(t) for py < t < qx. Thus, since c > 0, one obtains 


Pk 


> se(Pol fe(Pe) — felgl = f) dt =0. 


k=1 kal Uk 


Thus (16.45c) satisfies (16.44c). This completes the proof of the theorem. 


Remark (i). If g is strictly decreasing and does not change sign, then Theorem 16.24 
is also valid (with c < 0). 


Corollary (i). [46]. The inequality 


(16.44d) > pr felge) < >) pe fe(Pr) 


k=1 k=1 
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holds for fixed n > 3 and for all P,Q € r if, and only if, there exist constants 
c>0, bi, b2,..., bn such that 


(16.45d) fi (p) = clog p + by. 


Furthermore, the right-hand side of inequality (16.44d) is equal to 
n n 
k=1 k=1 


Proof. Letting g;(p) = p and using (16.45c), one obtains (16.45d). This completes 
the proof of the corollary. 


Remark (ii). Since in (16.44c) it is required that g;,’s be positive and strictly increas- 
ing continuous functions only, several choices are available for g;’s. Suppose we let 
gx(p) = p? and the g,’s are strictly increasing for B > 1 and strictly decreasing 
for B < 1. Also gxg(p) > 0 for all p in Jo. Thus, the right-hand side of the inequal- 
ity (16.44c) for f = fx (K = 1,2,...,n) becomes c + dH} (P), where c, d are 
constants and 


n 
(10.4) Hf (P) = /@'4 - 1) bs Ph - i} 
k=1 
the entropy of degree / (giving an inequality for the entropy of degree /). 


16.12.2 Application to Mixed Theory of Information 


In this section, we provide an application of Theorem 16.24 to determine the general 
solution of the inequality 


(16.52) > ge(Pa) fe (Ges Xe) < >> 8K (Pu) fie(Pes XK).- 


k=1 k=1 
The inequality 
n n 
D> Pr fees Xt) < > Pe fe (De, XK) 
k=1 k=1 
arises in connection with the generalized problem of “how to keep the (inset) expert 


honest” (Aczél [15, 22]). Inequality (16.52) is a generalization of the inequality 
above. Consider the randomized systems of events 


X1,%X2, se »Xn 

P15 P25+++5Pn : 
where pj; is the probability associated with the event x; (@ = 1,2,...,n; n > 2). 
The events x; are considered as pairwise disjoint elements of a ring of subsets 6 of 
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some given set Q. Letn > 3 and D? = {(41,X2,...,Xn) | x7 € B—{O}, x;Nxj; =O 
fori 4 j; i, 7 =1,2,...,n, andQ = (i_, x;}. We assume throughout this section 
that B is not an algebra (that is, B does not contain LJ. x). 

We now prove the following theorem. 


Theorem 16.25. (Kannappan and Sahoo [505]). Let gx : Ip — R be positive and 
strictly increasing continuous functions and fx : Ip x (6B — {0}) > R be real- 
valued functions fork = 1,2,...,n (n = 3). Then fy and gx satisfy (16.52) for all 
(P1, P2,--+5 Pn) and (41, 92,--+59n) € A° and all (x1,X2,...,Xn) € DD if, and 
only if, there exist constants b > 0, a € Io, and functions yx : (B — {0}) > R(k = 
1,2,...,n) such that 


Pk dt 
(16.52a) Sk (Pk Xk) = b | —— + yx (xx) 
a g(t) 
(XK € B— {0}, Pk € ]0, 1[, k= 1,2,...,7). 
Proof. Temporarily fixing x, in (16.52) and using Theorem 16.24, one obtains 
P dt 
(16.53) Si (Ds Xk) = bf ae ae 
a g(t) 


where b, a, and yx depend on the event x, € 6 — {0}. As in (16.45c), a and a are 
the same for all f; (k = 1,2,...,) and thus in (16.53), and thus we get 


(16.54) b(x1) = d(x), i =1,2,...,n, 
and 
(16.54a) a(xj) =a(xyj), 1=1,2,...,n, 
where (x1,.X2,...,Xn) € p>, 

Now we shall show that b and a are constants. Let us consider two nonzero 
arbitrary elements x;, y; in B, each contained in partitions (x1, x2,...,%n) and 
(91, 2,---» Yn) of Q belonging to De. Without loss of generality, we may assume 


x, Ay, 4 0. If x1 Oy, = O, then since (x1, x2,...,Xn) and (91, y2,..., ¥n) are 
both the partitions of Q, there exists an ig such that xj,  y; 4 0. By renaming x;, 
as x; and x; as xj,, we obtain a partition satisfying xj M y; # O. Next, we shall 
find two partitions of Q having one set in common. It is not difficult to see that 
(QapNyi)Axj =0, (41 — y1) Ux2) Nx; = 0 for all 7 > 3 and 1 Ny) (x1 — 
y,) Ux2) = 0 with (x; — y1) Uxe, (v1 — 41) U y2, x1 :M y; are in B — {0}. Thus, 
(x1 AN y1, @1 — y1) U x2, %3,...5 Xn) and (41 N y1, (V1 — X1) U yo, ¥3,.--5 Yn) also 
belong to Dp? Now, using (16.54a), we get 


b(x1 Ny1) = b(x3) = ++» = bn) = (3) = ++ = bY n) = (X11) = 11). 


Thus b is a constant. Similarly, by (16.54a), a is found to be a constant. Hence we 
obtain (16.52a). This completes the proof of the theorem. 


The following corollaries are obvious from Theorem 16.25. 


646 16 General Inequalities 


Corollary (ii). [22]. Suppose B is not an algebra. The inequality 


n n 


> PEF ks XK) <>) Pf (Pee XK) 


k=1 k=1 
holds for a fixed n > 3 and for all (x1, x2,...,Xn), (V1, Y23-++5 Yn) € Ao if, and 
only if, there exists a constant c > 0 and a function y : B — {0} > R such that 
Ff (Pk Xk) = clog pe + y (xx) 


for all x, € B — {0}, px € JO, 1[, k =1,2,...,n. Thus, the right-hand side of the 
inequality reduces to 


n n 
(16.55) © >) pe log pe + >) pey (x). 
k=1 k=1 


Remark (iii). The first term in (16.55) is a constant multiple of the Shannon entropy, 
and the second term is the sum of the expected value of a random variable [15, 22]. 
In a way, (16.55) has the form of an inset entropy. 


Corollary (iii). Suppose B is not an algebra. The inequality 


»2 ve f dk. xk) < SY PEF (Pes XK) 


k=1 k=1 


holds for a fixedn > 3, 6 > 1, and for all (x1, x2,...,Xn) € p®: (P1, P2,-++5 Pn)> 
(41, 92,-+-»dn) € Ao if, and only if, there exist constants d, c, and a function 
y : B — {0} > R such that 


d = 
P (Pin X8) = GPK? +7 GH) +e 


for all x; € B — {0}, px € JO, 1[. Thus the right-hand side of the inequality reduces 
to 


n 
(16.56) a+ bHh(P) +>. ply (ee), 
k=1 
where H/(P) is the entropy of degree f and a, b are constants. 
Remark (iv). Note that, in a way, (16.56) has the form of an inset entropy of degree 
f (see [51]). 
Absolutely Continuous Solution of the Inequality (16.44e) 


Kullback-Leibler directed divergence D,(P||Q) defined by (dd) serves as a sepa- 
rability measure (Veglius, Janson, and Johansson [815]) between the distributions 
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P and Q. Itis frequently used in statistics (Kullback [570]), pattern recognition (Ton 
and Gonzales [805]), coding theory, signal processing, and analysis of questionnaire 
data. This measure locally behaves like the square of the distance (Kas [528]) 


2) 
d(01,02) = | Vi@ad, 
a 


where ./7(@) is the positive definite square root of the information matrix i(@), with 
@ being the parameter. Further, it is also connected with the y-square discrepancy 
measure and is the derivative of the Chernoff coefficient [151, Chapter 10] with 
respect toa ata = 1. 


Definition. A function 1, : T° > R is called a separability measure if and only if 
Ln(P||Q) = 0, “n(P||Q) attains the minimum if P = Q forall P,Q € re with 
n> 2. 


Definition. A separability measure “, : T o — Ris a distance measure of Kullback- 
Leibler type if there exists a function f : Jj — R such that 


Un(P||Q) = >> ref (Pe) — fe) 


k=1 
forall P,Q ET : with n > 2. The function f is called a generating function for the 
distance measure of Kullback-Leibler type. 


In view of these definitions, it is easy to see that the distance measures with the 
generating function f : Io — R satisfy (16.44) for P, Q € re (with n > 2). The 
most general inequality arising from (16.44) is (16.44c), studied in Theorem 16.24. 
Now we consider the absolutely continuous solution of (16.44c) holding forn = 2, 
namely 


(16.44e) gi@)fidy) + 820 — x) fx — y) < g1@) fil) + g20l — x) fa — x), 


for x, y € JO, If. 

Let gx : Io > Ri (k = 1,2) be positive, strictly increasing, and continuous, 
and fx : Io > R (k = 1, 2) be absolutely continuous functions, and these functions 
satisfy the functional inequality (16.44e). Rewriting this equation, we obtain 


(16.57) si@yfidy) — fi@)} S g201 — xf — x) - fl — y)} 
for all x, y € Jp. Interchanging x with y in the inequality above, we obtain 


(16.57a) riOUNG) = AO) s 20 = yi —y)— Ad =2)) 


Since go is positive, multiplying (16.57) by g2(1 — y) and (16.57a) by g2(1 — x) and 
then adding the resultant, we obtain 


(16.58) {gi(y)g2(1 — x) — gi(@)g201 — HA) — fiQ”)} < 0. 
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Since the functions g; and go are strictly increasing, if x > y, then 


gil(y)g2l — x) — gi@)g2 — y) <0 


and hence (16.58) yields f;(x) > f(y). That is, f; is an increasing function on Jp. 
Similarly, f2 is also an increasing function on Jo. 


Remark. Note that the monotonicity of /;’s follow from the monotonicity of g;’s. 
Here the absolute continuity of f;’s is not used. 


Also, from (16.57) and (16.57a), we get for x > y 


gl —x) h—y)— fll—=) _ A&)—NO) 
gi(x) x-—y ~ x—y 
get=)) Day) Hil 7x) 
~  gily) x—-y 


From this inequality, we conclude using the continuity of g; and go that if f5(1 — x) 
exists at 1 — x, then f i (x) exists at x; that is, if fo is differentiable at 1 — x, then so 
is f; at x. Since g; and go are strictly increasing continuous functions and since f; 
and f2 have derivatives almost everywhere, we get 


(16.59) gi(x) fi (x) = g2(1 — x) fy. — x) 
for almost all x € Jp and (refer to Natanson [644, p. 255]) 
x 1 
(16.60) fie) = fat | PP ya — ar 
a gi(t) 
and 
(16.60a) fo(x) = w(x), 


where y is an arbitrary absolutely continuous increasing function on Jp anda € Io 
is a constant. Thus we have proved the following theorem. 


Theorem 16.26. (Kannappan and Sahoo [507]). Let gx : Io > Ri (k = 1,2) be 
positive, strictly increasing, and continuous functions on Ip. Let fx : Ip > R(k = 
1, 2) be real-valued functions and satisfy (16.44e). Then f,’s are increasing. In 
addition, if f\ is differentiable at x, then fz is differentiable at (1 — x). Further, if 
Jx’s are absolutely continuous, then f;.’s satisfy (16.44e) if, and only if, fx’s have 
the form given in (16.60) and (16.60a). 


16.12.3 Continuous Solution of the Inequality (16.61) 


Now we find the continuous solution of (16.61), which is a special case of (16.44e) 
for gi(x) = x = go(x), by making use of a result in [641], 


(16.61) xfiy)+d—x)f2d — y) <xfi@)+ A —-x) fod — x). 
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Theorem 16.27. [507]. Let fi, fo : Io — R be real-valued continuous functions, 
and they satisfy (16.61) for all x, y € Ip. Then f\ and f2 are of the form 


1 


(16.6 1a) AQ) =(—-x)H (« = =) +f H(t)\dt+c1, x€Io, 
0 


and 


i 
2 


(16.61b) foa)=(@-lA (5 = a) +/ ‘ H(-t)dt+c2, x €lo, 
0 


where H : I\;2 = 0, al — R is an arbitrary continuous and increasing function 
and c\, C2 are arbitrary constants. 


Proof. Let f\, f2 : Io — R be continuous and satisfy (16.61). First of all, no- 
tice that if f; and f> are solutions of (16.61), so also are f; +a, and f2 + a2 for 
some arbitrary constants a; and az. Replacing x by x + 5 and y by y+ 5 in (16.61), 
we obtain 


(+) 0669-06) 


Gayi=s{n (Ge)+n (5-*)| 
and 
1 1 
H(x):= Fi (5+x)-2(5-)]. 


we obtain from the inequality above 
(16.62) G(x) — G(y) = —x{H@) - H(Y)} 


for all x, y € 11/2. From Theorem 16.26, we see that f; and fz are increasing. Thus 
H is also increasing on the interval /;/2. As f; and f2 are continuous, so are H and 
G on the interval J) /2. Interchanging x and y in (16.62), we have 


(16.62a) G(y) — G@&) = -y{H(y) — H@)}. 
From these, we obtain 


—x{H (x) — H(y)} < G(x) — G(y) < —y{H (x) — H(y)}. 
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If H(x) = A(y) forall x, y € [i/2, then G(x) = G(y), which in fact shows that G 
is identically a constant. Hence, from the definitions of G and H, we see that /; and 
f2 are constants. 

Suppose now that there exist x and y such that H(x) # H(y). Then, as in [641], 
if we define Q = {H(x) | x € 1/2} and ¢ : Q > R by ¢(@) = G(x), where x is 
such that H (x) = a, it follows that 


(16.63) G(x) = d(A(x)) = aH + H(t)dt +c, 
0 
where c is an arbitrary constant. Since 
fi (; +x) = G(x) + 5H) and fa (; — ) = G(x) - 5H), 


we get from (16.63) 


fi (5+2) = (5-») H+ [ woarse 
fr (5-») =- (54x) H+ [Hoare 


These can be rewritten as (16.61a) and (16.61b). This completes the proof of the 
theorem. 


and 


Corollary. The continuous solution of the functional inequality 
2 2 
> pf (Qe) < >. ref (Pe) 
k=1 ke] 


where P,Q € T°, is given by 


f@)=(A-x)H (s-s)+[0° H(t)dt+c, 


where H is a continuous, increasing, and odd function on the interval I, /2. 


Remark. The distance measures of Kullback-Leibler type on 1 with a continuous 
generating function are of the form 


2 1 I 
u2(P, QO) = >) pk {« — px)H (n = 5) —(1— 4%)H (« - )} 
k=1 
ss Pk- 
+> | _ Ade, 
k=1 


dk-3 


where H : 11/2 — Ris an arbitrary continuous, increasing, odd function. 
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16.13 Reproducing Scoring Systems 


A direct approach to measuring a subject’s knowledge concerning a test item is to 
attempt to elicit from him his subjective probability distribution over the options. 
We shall investigate scoring systems designed for this purpose. There are many 
techniques for assessing the present state of a subject’s knowledge. In the case of 
objective testing and programmed instruction techniques, most of the information 
necessary to assess a subject’s knowledge is contained in the subjective probabilities 
(of the subject) concerning the correctness of various possible answers. To measure 
these subjective probabilities, it is necessary to have a scoring system. In connection 
with weather forecasting, scoring systems were used long ago in [83]. For scoring 
systems and other related materials, [771] is a good source of references. Assume 
that a subject has a subjective probability distribution P = (pj, p2,..., Pn) € Tn 
over the n options of a particular item. Let Q = (q1,q2,.--,dn) € In be 
any probability distribution. A scoring system is a collection of n-place functions 
{fi, fo...» fn} for which fi (q1, 92, .--,n) is the score awarded to the subject for 
the ith option if he indicates that Q is his subjective probability distribution. The 
expected score of the subject from giving the distribution Q as his response to the 
item is 
n 
(16.64) En(P, Q) = >) pe fi (4, 425 +++ 4n)s 
k=1 
and the loss to the subject by giving Q as his subjective probability when it is actually 
P is 
Dy(P||Q) = En(P, P) — En(P, Q). 
The collection of scoring functions { f, fo,..., fn} is a reproducing scoring sys- 


tem (RSS) if it encourages the subject to report his true probabilities (see Weber [826, 
p. 269]). That is, (fi, fo,..., fn} is an RSS if 


(16.64a) Mees Q)=E,(P, P). 
€ n 


From (16.64) and (16.64a), we obtain the functional inequality 


n n 
(16.64b) patina. + <2sGn) = > pe felpi; Pos<:+s-Pr); 
k=1 k=1 
where P,Q € Ty. 

The solution of (16.64b) generates the scoring functions for the RSS for multiple- 
choice testing. The functional inequality above was solved in [735] for the two- 
option case (that is, form = 2,) with the assumptions that yi (x) := fi(x, 1—x) and 
w(x) := fo — x, x) are both differentiable and nondecreasing on [0, 1]. 

Consider the inequality 


(16.65) ge (pe) fe (G1. 425 -+-59n) < >. 8k(Pk) fe (PIs P25 --++ Pn)s 


k=1 k=1 
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where P, Q € c and gx, : J9 > R (kK = 1,2,...,n) are continuous monotonically 
increasing and positive functions. In what follows, we assume once and for all that 
gx’s have the properties above. The functional inequality (16.65) is a generalization 
of (16.64b). For the two-option case, (16.65) yields 

(16.65a) 


g1(pi) fii, 92) + 82(p2) f2(q1, 92) < gi(Pi) fi (Pi, p2) + 82(P2) f2(P1, P2)s 


where pj + p2 = | = qi + q2. Denote p; = p and q; = q in (16.65a) to obtain 
(16.66) gi(p)yi(q) + g2(1 — p)y2— 4) < gi(p)wi(p) + g21 — p)yal — p), 


where wi(p) = fi(p,1— p), y2(p) = fo — p, p), and p,g € Io. 

Now we investigate the functional inequality (16.66). First of all, we will show 
that every solution of (16.66) is monotonically increasing. Second, we will deter- 
mine the general solution of (16.66) under weaker regularity conditions on y;, than 
those assumed for solving (16.66) with g,(p) = p (k = 1,2) in [735]. Finally, we 
propose a generalization of the RSS. 


16.13.1 Solution of the Functional Inequality (16.66) 


Let gx : In > R (k = 1, 2) be positive, strictly increasing, and continuous, and let 
wr : Io > R (k = 1,2) be absolutely continuous functions (see Natanson [644, 
p. 243] and these functions satisfy the functional inequality (16.66). 

Rewriting this, we obtain 


(16.67) si(p)(wi(g) — wi(P)) S 920 — pP)(w2U — p) — wal — 4) 


for all p,g € Jo. Interchanging p with g in (16.67), the inequality becomes 


(16.67a) gil(g)(vi(p) — ywi(g)) S 820 — gal — 4g) — yo — p)). 


Since go is positive, multiplying (16.67) by g2(1 — q) and (16.67a) by g2(1 — p), 
and then adding the resultant, we obtain 


(16.67b) (g1(q)g2U. — p) — gi(p)g20 — 4) (i (Pp) — wi(q)) < 9. 


Since the functions g; and go are strictly increasing, if p > qg, then gi(qg)go01 — 
P)— gi(p)g2(1 — gq) < 0 and hence (16.67) yields wi(p) => wi(q). That is, y is an 
increasing function on Jo. Similarly, y2 is also an increasing function on Jp. 


Remark (i). Note that the monotonicity of y;’s follows from the monotonicity of 
gx s. Here the absolute continuity of y,x’s is not used. 
Also from (16.67) and (16.67a), we get for p > q 


8201 — p) wel — 9) — wl p) —§ mi) — 1@) 
gi(p) pq ~ p=@ 
— 20-4) yl —g) — we — ay 
ei(q) p-q 
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From the inequality above, we conclude using the continuity of g1, g2 that if yi(1 — 
p) exists at 1 — p, then y}(p) exists at p; that is, if y2 is differentiable at 1 — p, 
then so is yy; at p. Since g; and g2 are continuous and monotone functions have 
derivatives almost everywhere, we have 


gi(p) vi (Pp) = g2(1 — p)yw50 — p) 


for almost all p € Jo and (see [644, p. 255]) 


P _ 
(16.68) n@)=n@+ / gl) via = nat 
a gi(t) 
and 
(16.68a) wo(p) = w(p), 


where y is an arbitrary absolutely continuous and increasing function on Jp, and 
a € Ig is aconstant. Thus we have proved the following theorem. 


Theorem 16.28. Let gx’s (k = 1, 2) be positive, strictly increasing, and continuous 
functions on Ip. Let we : Ip > R (k = 1,2) be real-valued functions and satisfy 
(16.66). Then wx’s are monotonically increasing. If yw, is differentiable at p, then 
y2 is differentiable at 1 — p. Suppose wx’s are absolutely continuous. Then wx’s 
satisfy (16.66) if, and only if, y's have the form given in (16.68) and (16.68a). 


Remark (ii). If y,’s are differentiable, then solutions of (16.66) can be obtained 
from (16.68) and (16.68a). Assuming g,;’s to be the identity function, Sholander 
[735] found the differentiable solutions of (16.64b) for n = 2. Some of the conse- 
quences of Theorem 16.28 are given below. 


Corollary (i). [f yi(p) = filp, 1 — p) and y2(p) = f2U1 — p, p) are absolutely 
continuous and satisfy (16.64b) for n = 2 with g1(p) = g2(p) = Pp, then fx’s are of 
the form 


Pj 
filp,1-p)= viay+ | (- 7 :) vd —bdt 


and 
fo(p,1— p)= yw - p), 


where w(x) is an arbitrary but monotonically increasing and absolutely continuous 
function on Ip and a € Ip is an arbitrary constant. 


Example (i). Let gi(p) = g2(p) = p in (16.66). Further, let wo(p) = —5(1 — py’ 
be a monotonically increasing function on Jp. Then, from (16.68), we get yi(p) = 
—5(1 — p)*. Therefore, the scoring functions for the RSS are f\(p, 1— p) = fol— 
Pp, p) =—4(1— p)’. 


Example (ii). Let gj(p) = p and g2(p) = p? in (16.66). Assume that y2(p) = 
—35(1 — p). Then, by (16.68), we get 
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P (1 —t) 
in(p) = vita) + | Ca at. 


a 


: : : 1—p)3 
cue the scoring functions are given by fi(p, 1—p) = ak fi and fa(p, |—p) = 
5 p. 
Corollary (ii). Let gx’s be positive, strictly increasing, and continuous functions on 


Ip. Let wi(p) = fi(p,1— p) and wo(p) = f2U — p, p) be absolutely continuous 
and satisfy (16.66). If fi = fo = f, then fi(p, 1 — p) =c (constant) for almost all 


Pp €1o. 


Proof. Now f; = f2 on Jo implies y1(p) = w2(1 — p) on Jo. Thus, from (16.67b), 


we obtain (g1(p) + g2(1 — p))w}(p) = 0. Since gi(p) + g2(1 — p) > 0, we have 
wv; (p) = 0. Hence y;(p) is a constant for almost all p € Jo. 


Example (iii). For g1(p) = g2(p) = p and fi = fo = f, then one of the solutions 
of (16.66) is 2p — p (cf. Weber [826]). 


Example (iv). If we do not impose the condition of absolute continuity on wx’s, 
then there are solutions of the functional inequality (16.66) that are not of the form 
(16.68) and (16.68a). For example, if g1(p) = g2(p) = p and yi(t) = wo(t), then, 
for a positive constant c, the saltus function 


0, if0<p<5, 
fps io wp=z, 
2c, ifs <p<1, 


is a solution of (16.66). 


16.13.2 Symmetric Reproducing Scoring Systems 


A reproducing scoring system {f1, fo,..., fn} is said to be a symmetric reproducing 
scoring system (SRSS) if, and only if, 


fri) (P1, P25+++5 Pn) = fi (Px(1)> Pr(2)o+++5 Px(n)) 


for every permutation z of {1,2,...,}. For details regarding SRSS, refer to [735]. 

In the SRSS {fi(p1, p2,---, Pn) | i = 1,2,...,n}, the score awarded depends 
not only on the probability assigned to the correct option but also the distribution 
over the various incorrect options. An SRSS does not depend on the distribution of 
incorrect options if 


fr(i)(P1s P2+ +++» Pn) = f (Pri): 


A functional inequality related to such a special kind of SRSS is the inequality 
(16.44) (Shannon’s inequality). The general solution of (16.44) for n > 3 with no 
regularity assumptions on the unknown function f is given by (16.45). In [735] it 
was shown that, under the differentiability of the scoring function f, such a special 
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type of SRSS gives rise to a logarithmic scoring system. However, even without 
assuming any regularity condition on f, one may arrive at a logarithmic scoring 
system from (16.44) forn > 2. 

Sholander [735] has discussed the construction of an RSS in which the subject’s 
score depends only on the probability assigned to the correct answer and not on 
the probabilities assigned to the incorrect answers. For such a scoring system, the 
scoring functions would have the property that 


Ik; P2; <4 Dn) = hx (px) 


for all P € T°. Assuming hx’s to be continuous over the closed unit interval and 
continuously differentiable over its interior, it was shown in [735] that h;’s have the 
form 

hk(p)=alogp+bhky, pe€lo, 


where a, bj, b2,..., bn are arbitrary constants. In this RSS, the loss to the subject 
when he gives Q as his subjective probability when it is actually P is (dd) D,(P||Q). 
The aforementioned type of RSS with the special scoring functions hx satisfies the 
functional inequality (16.44). 


16.13.3 A Generalization of RSS 


Consider that a subject has a_ subjective probability distribution P = 


(P1, P2,--+5Pn) € ie over the n options of a particular item. Let Q ¢€ es 
be any probability distribution. Let {fi, fo,..., fr} be a scoring system and 
Fi(G1, 92,--+;9n) be the score awarded to the subject for the ith option if he in- 


dicates that Q is his subjective probability distribution. In such a scoring system, 
the score awarded depends not only on the probability of the correct option but also 
on the probabilities of the incorrect options. To counterbalance the effect of the 
probabilities of the incorrect options, we award g;(pi) fi(q1, 92, ---, Qn) as his score 
instead of fi(q1,q2,.--,9n). The function g;(p) can be regarded as the weighting 
function depending only on the true probability of the ith option. Then we define the 
total weighted score of the subject from giving the distribution Q as his response to 
the item as 


n 
Ta(P, O) = >) gk (Pr) fe (G1, 92 +--+ 4). 
k=1 
A collection of scoring functions {/1, f2,..., fn} is a generalized reproducing 
scoring system if, and only if, 


Max Tn(P, Q) = Tr(P, P). 
QcT9 
It is easy to see that the reproducing scoring system is a special case of the gen- 
eralized reproducing scoring system since 


Tr(P, Q) = En(P, Q) 
if gg(p) = p. The following is an example of a generalized RSS. 
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Example (v). Let g,(p) = p? (k = 1,2). Then the set of scoring functions 
tis fr; eae | Fa}, where 


Sk(P1s P2,+++> Pn) = 


1/2? 
4 
(S71 7’) 


defines a generalized RSS. To see this, notice that 


1/2 
n n 272 Us 

Dok=1 Pid 

>! PEARL 925 +++ Gn) = : 1 <(>o% 
k=l (7-144) 


n 
= >) pi fe(P1, prs --+s Pn). 


k=1 


This inequality is obtained via Holder’s inequality. Equality in the inequality above 
holds if, and only if, gx = apg (k = 1,2,...,m), where a is a constant. 

Since (p1, p2,---, Pn) and (q1, q2,.--,9n) are in re, a = 1, and hence, for 
Per 34 pei, 42, ++++Qn) iS maximized when P = Q. 

A special case of the generalized RSS that may be of some interest is the one that 
has the property that fi; (p1, p2,---, Pn) = hk(px) (K = 1,2,...,n) forall P € ro. 
Such a system gives rise to the functional inequality (16.44c). 


16.14 More Inequalities from Information 
Theory 


We are concerned with the functional inequality 


(16.69) ye oe 2 1, 


where P,Q ¢ T°, fi : Io > R* (that is, f;(p) 4 0) holding for fixed n > 2, 
connected with Rényi’s entropy. 
Result 16.29. (Kardos [526]). Let f; : Ig > R* @ = 1, 2) satisfy the inequality 
r—p 
ph) Sips pyLe=P) P) 25 
fi) for — 4) 


for all p,q € \0,r| for fixedr € Ip. If the f; do not change signs, then each of the 
following holds: 


(16.69a) 


(i) fj is monotonically decreasing (increasing) on 0, r{ if fj is positive (negative) 
on Io. 
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(ii) p — pfi(p) is increasing (decreasing) on ]0, r[ if fj is positive (negative) on 
Io. 

(iii) fj is locally absolutely continuous on ]0, r[. 

(iv) If fi is differentiable at p, then f is differentiable at r — p and the following 
relation is valid: 


pil (p) _ fh =p) 
ip) wep) 
All solutions f; that do not change signs on Ip (i = 1, 2) of the inequality 
fil) iol =p) 
16.70 = 1, O 1,0 1 
(16.70) Poa) Pye —@) Qe ee ene Se 


are of the form 


POA = 
ftp) =aexp( [PEE Par), pip) = be). peo 


where a, b, and c are arbitrary, ab 4 0, c € Ip, with g arbitrarily continuous, 
positive, and decreasing, and p — pg(p) increasing on Ip. 


Corollary. All solutions of (16.70) when f2 = fi = f are of the form 


ftv) =aexp( | Oa). pélo, 


where a #0, b € I with G arbitrarily measurable on Io and satisfying, for almost 
all p € Ip, GU — p) = G(p) and —-1 < G(p) <0. 


Result 16.30. [526]. /f fi; @ = 1,2,...,n) do not change signs, then the general 
solution to (16.69) for fixed n > 3 has hedoits 


fi(p) = bi p*, i=1,2,...,n, 


where —1 <a <Oandb; > 0(< 0) if fi > 0(< 0). 


Next in line is the inequality 


(16.71) ye oe 4, 


for all P, Q € T°, holding for some integer n > 2. 


n? 
Result 16.31. (Fischer [276]). Let f : Io > Ri, satisfy (16.71). If (16.71) holds 


for a fixed n > 3 and if f is monotonic, then f is differentiable and has the form 
ft (p) = ap*, where a > O. and either c < —lorc > 0. 
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From the measure (dd) D,(P||Q) and other considerations, we studied (16.44). 
We considered symmetric distance measures in Chapter 10. From this symmetric 
divergence or J-divergence 


Mn(P||Q) = Dn(PI|Q) + Dn(QIlP) 


n 
= >) pilog pi + gi log gi — pi log qi — ai log pi 


i=1 
there results the functional inequality 


n n 


(16.72) Vif (ei) +a FQ = “wifa) +a f wil 


i=1 i=1 


for P, Q € T° (see [275]). 


16.15 Inequalities from Inner Product Spaces 


In Skof [752] it is shown that if f : X — Rand X a real linear space satisfies the 
equations 


(16.73) If +y)| = |2f@) + 2f0) — f@ —y)I, 
(16.73a) If@ — yl = |2f@)+2f() — f@+y)I, 
(16.73b) If@+y)+ f@ —y)—2f@)| =21fO), 
(16.73c) If@+y)— f@ —y)—2fO) =21F @)I, 
(16.73d) If +y)+ f@ —y) = l2f0)+2fO)1, 


for x, y € X, then f satisfies the Jordan-von Neumann parallelogram equation; that 
is, the quadratic equation (Q). 


Remarks 
i) f :R— R given by 
cx"; 


x > 
> = c #0, 


cx*, x <0, 


ro =| 


satisfies the inequality version of (16.73a), 


I2f(x)+2f0)- fa+y)I <If@-y)I, 


but not the quadratic equation (Q). 
(ii) f : R > R defined by f(x) = cx", c #0, ne Zi (> 1), x € R satisfies 
the inequalities 
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I2f@)| <If@+y)+ f@—y)—-2fO), 
I2FO) SIF@+y)+ f&—y)—2F@)L, 
I2f()+2fO) SIf@+y)+ f@—y), 


for x, y € R, but not (Q). 
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The situation changes if we consider the inequality arising from (16.73), namely 


(16.74) 20) + 2f0) — Fay I< IF @y)IL 


for x, y € Ga group. It is proved thatif f : G —> X satisfies (16.74), where G is 
a group, X is an i.p.s., and G is a 2-divisible Abelian group or is 2-divisible and f 


satisfies (K), then f satisfies (Q). 


Here we present the proof from [695] and prove the following theorem. 


Theorem 16.32. (Ratz [695]). Let G be a group and X ani.p.s. over a field K (= R 


or C), and f : G > X satisfies the inequality (16.74), 


(16.741) f (xy) = fx), 

and 

(16.74ii) f(xyxy—!) = f(x), forx,y eG. 
Then 

(16.75) 7O=0; 20H=f6 Ww fO)=47@), 


x € G, and f is quadratic (Q). 


Proof. x = ¢ in (16.74) gives f(e) = 0, and e the identity in G. y = x~! in (16.74) 


implies 
(16.76) 2f (x) +2f(«7!) = fx’) forx eG. 
Let y = x in (16.74) to get 

(16.77) All fF@)I <I F@)|| forx € G. 


Now 


4 fC) < FGI < 2F GOI +2 fF @7 DI 


by (16.76) or 
IFN < FQ, 


and in turn < || f(x)||; that is, 


(16.77a) If@)I = NFA) 
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and 


(16.77b) If) = 4 £1. 


Now 
WAG) — f@7 I? = FDU? + FO DI? -— FG) + FO DI? 


(parallelogram identity in an i.p.s.) 


| 
= All f(x)II? - glF@r (by (16.77a) and (16.76)) 


=4ll f(x)? — 4 f@)I? (by (16.77b)) 
= (0): 


Thus f(x) = f(x!) and f (x7) = 4f (x). This proves (16.75). 


Replace x by xy, and y by xy? in (16.74), and use (16.75), (16.741), and 
(16.7411) to obtain 


I2f ay) + 2f Gy!) —4FO)I < 4F@OIL 


Squaring both sides, we obtain 


2 f xy) +2 (xy)I7+16]] f OI? -2Re(2 f ry) +2f (xy~'), 4f)) < 161 f @)II?. 


Interchange x and y and use f even from (16.75) to get 


2 f xy) +2f (ey )I7-+16]] f I? -2Re(2 f xy) +2f (xy), 4f (x) < 161 fF OI. 


Adding, we have 


(16.78) f(xy) + f(xy)? < 2Re( f(xy) + fey), fF) + f(y) 
<2 fy) + fey Dil If @)+ FOI 
(Cauchy-Schwarz inequality) 


or 
If@y)+ fay) < 21f@) + ODI 


Letu,v € G. Putx =uv, y= uv—', and use (16.74), (16.741), and (16.74ii) in the 
above to have 


4 fu) +4fO)Il < 21 f(uo) + fo )IL, 


so that 


(16.79) fy) + Foy") = 21 f@)+ FOI 
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Now 


If ey) + f@y7!) — 2f@) -—2F ODI? 
= If (xy) + fay DI? 
+All f(x) + fO)IP — 2Re( fry) + fy!) 2f@) + 20) 
= 2If (xy) + f@ey7'DI? 
— 2Re(f (xy) + f(xy7!),2F() + 2f()) by (16.79) 
<0 (by (16.78)). 


Thus f satisfies (Q). This completes the proof of this theorem. 


By some modification of parallelogram identity: 


(i) |lu + v||? + |lu — oll? ~ 4, where ~ denotes either < or > for ||u|| = |lo|| = 1 
[214]. 
(ii) For ||u|| = 1 = |lo||, there are a, 8 4 0 such that 


llau + Bo||?||Bu — avl|* ~ 2(a? + p*) 
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16.16.1 More on Convex Functions 


Let H be a convex open nonempty subset of IR” and 6, = {T C R” : every convex 
function bounded above on T is continuous}. Which subsets belong to B,? Every 
nonempty open subset T of R” is in ¢,. Let C C R” be an open connected set 
(n > 2). If f : C > Ris a continuous function such that gr f (graph of f) contains 
(n + 1) affinely independent points, then gr f € B, (Jablanski [387]). 


Suppose f : C — R is convex (16.10) (concave), where C C X (a real linear 
space) is a convex set. Then, for x1,%2,...,%) € C, Pi,---,Pn € Re, Po = 
+1 Pi > O, the following inequalities hold [227]: 


(i) 


1 nN 1 n 
(FY rm) sOES nso 
i=l " j=] 
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(ii) 
1X 1 ‘ , 
(gL) srg 2 Pi," Pigs F pai 
ick 
<Op 2) Pir Pah 5 ty 2{s).1 2) 
j=l 


1 Xi, +X; 
es 7 Pi, Dio f (ae) 


Py i, ,i2=1 
1 n 
Si > pif &i). 
” i=1 
where k is a positive integer such that | <k <n—1. 


(iii) Let f : C — R be aconvex mapping, xj € C, p; => 0, and P, > 0. Then the 
inequality 


I we Ie 
B, DPF) - (= do) 


1 n 1 1 k 
oD, Pif Gi) — oe >» Pi, ++ Pin f pa 
” j=1 1 iyi j=l 


1 n 


Es PE > oo * Pix Ere) 
He 


Vv 


at 
~ PE >» oe * Pix (pds 20 


holds for every positive integer k such that 1 <k <n. 


Let f : C CR — R be aconvex function (16.10) on an interval C anda,b € C 
with a < b. The double inequality 


a+b fla)+ £©) 
5() <P pear < HOF 


is known in the literature as Hadamard’s inequalities. The inequalities are due to 
C. Hermite, who discovered them in 1883, ten years before J. Hadamard [333]. 

The following refinement of the first inequality holds [226]. Let f be as above 
and a, f : [a,b] — R + be two continuous mappings so that a(x) + f(y) > 0 for 
all x, y € [a, b]. Then one has the inequalities 
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a+b x+y 
r( 2 rere al fs a ) avay 
a(x)x + B(y)y 
esol [3 5|/ (or ) 


pee) a d 
+f mye 


b 
< —/ fOrdx. 


For a given convex mapping (16.10) f : [a,b] > R: 


(1) Let H : I > R be defined by 


b 
H(t):= —/| f (1 +(1 =F") ax. 


The following hold: 


(i) A is convex on Io. 


a b 
(i) sup,eo,yH® =H) = cy J? f@)dx. 
(iii) H is monotonically nondecreasing on J. 


(II) Let F : J > R define 
l b pb 
F(t) := eal | f(tx + (1 —t)y)dxdy. 
The following hold: 
(i) F (s +4) =F ($-5) forall s in [0, 3]. 


(ii) F is convex on J. 
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(iii) F is monotonically nonincreasing on [0. 3| and nondecreasing on [+. i]. 


(iv) One has the inequality 


H(t) < F(t) forallte J. 


16.16.2 Inequalities for Integrals 


Inequalities of interest may be obtained by integrating, under appropriate conditions, 
known inequalities for functions. We consider explicitly two such inequalities that 


seem to be both simple and useful and are suggestive of further generalizations. 


Let f(x) be a nonnegative function of a real variable x, and let S be a set of real 


numbers such that the integral 


I = [p= [ireotar 
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taken over an appropriate range (a, b), with b > a, exists when k € S. Then 


(16.80) pare fr a aaa 
2 
(16.81) Liye > ie gor ; 


where m, n, and s(m +n) belong to the set S. 

Inequality (16.80) may be proved by starting with the arithmetic mean—geometric 
mean inequality, and (16.81) is a particular case of the Schwarz inequality. Both of 
these may also be independently derived by using properties of convex functions 
[252, 215]. 

Two particular cases are worthy of note. 


(i) Writing 6 = b—a = range of integration, and taking m = 1, n = —1, we have 


1 
16.81 +/ 2 08. 
(16.81a) / f+ fs 


wom (8 


(ii) When m = 2, n = 0, (16.81) gives 


wenn fr fra(fiy 


A geometrical representation of this may be obtained as follows. Let PQ be 
the arc of the curve y = f(x) between x = a and x = b. Let s be the area 
bounded by the arc PQ, the ordinates at P and Q, and the x-axis; and let V be 
the volume of the solid of revolution formed by rotating the area A about the 
x-axis. Then inequality (16.81c) is equivalent to 


Vo= ms’. 


By taking different forms of the function f, we should be able to obtain inequal- 
ities of interest. 
For example: 


(i) Let f(t) =e’, a = —co, b = 0, m, and n positive numbers. From (16.80), 
1 1 2 
—_> 


+-= : 
m n m+n 


(ii) Let f(t) =e', a =—1, b =1, andm,n any real numbers. Inequality (16.80) 
gives 
sinhm = sinhn _ 2sinh(m+n)/2 
ee ee eee 


m no m+2 
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(iii) Let f(t) =te‘', a=0, b=. From (16.81), 


T(m+1)P (+1) 7 mitiyntl 


(16.82) (r (mn 4 yy = (aye 


(iv) Let f(t) = VU — k2 sin? t), a=0, b=x. Then (16.81a) and (16.81b) give 


(x, k) + F(x, k) = 2x, 
C(x, k)F (x, k) > x?, 


where @ and F are elliptic integrals of the first and second kinds. 


As a slightly more general form of (16.80) and (16.81), we have 


[ert [ er" a gfe, 
( / ef”) ( / sf") > ( i afm) 


where f is as before and g(x) is a nonnegative function of x. If f(t) = t, a = 
0, b=, the latter gives 


2 
T+) +1) > [r(“+ +1)| 


This may be compared with (16.82) above. 
A variety of inequalities may be obtained from further generalizations. 


16.16.3 Cauchy-Schwarz-Holder Inequalities 


Given a positive integer n and real numbers az, by (K = 1,...,), Cauchy’s in- 
equality (Cauchy [147, pp. 373-374, Theorem XVI)) states that 


n 2 n n 
(16.83) (> ats) 2 63 “) (> 2) ; 
k=1 k=1 k=1 


Milne made the interesting discovery that if no ay + be is 0, the expression 


n n 
(Sets) Setgree-ep 
1 1 
lies between the left- and right-hand sides of (16.83). Callebaut [140] has shown that 
the function 


a R(x) = (aor) 


1 1 
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with all a,, bx positive is increasing with |x|. This function with r = 1 yields 
the left-hand side of (16.83) when x = 0 and the right-hand side when x = 1. 
In [254] it was observed that R(x) is convex (16.10) with a minimum at x = 0. 
Callebaut’s result follows immediately from this observation. Similarly, construct- 
ing appropriate convex functions, [255, 215, 216, 217] gave new proofs of a num- 
ber of classical inequalities and numerous generalizations of the Cauchy and Calle- 
baut inequalities. Among these generalizations was the following [217]. Let 
a1,42,...,4n, bj, b2,...,bn, p, ¢ be positive numbers with (1/p) + (1/q) = 1. 
Then the function 


n I/p n 1/q 
Hi) = (> ote Ogee) (> apse) 


1 1 


is convex (16.10) with a minimum at x = 0. Consequently, H(O) < H(1), which 


yields 
n n 1/p n 1/q 
Yan =(Let) (La) 
1 


1 1 


Holder’s inequality. 


Generalizations of Cauchy’s Inequality 


We make use of the simple observation that for K > 0 the function K* (and hence 
also K~*) is convex (16.10). We also note that the sum of convex functions is convex 
and the product of a convex function by a positive constant is convex. 

Let Ap, “Mk, pe, and qx (kK = 1,...,n) be positive numbers. Then the function 


T(x) = (> mai \(> aut) + (> nis") (> au’) 
1 1 1 1 


is convex with a minimum at x = 0. It is a constant if, and only if, 4; = 42 =--- = 
-1 -1 -1 
An= Hy = hy = =H - 


Result 16.33. (Eliezer and Mond [255]). Let ag, bx, cz, and dy (k = 1,...,n) be 
positive numbers and let rj, 5; (i = 1,2, 3,4) be real. Then the function 


n n 
ry +8 xX prot+s2Xx 13+83X 7r4+84Xx 
Ti(x) = (> ay by Ie eo", ) 


1 1 


n n 
11 —S{X 1 rg—SAX 13—S3X 7F4—S4X 
+(e \(reproraye™) 
1 1 


is convex with a minimum at x = 0. It is a constant if and only if, fork = 
12. 23.9% abe — ae ils = a, where «a. is independent of k. 
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Generalization of Hélder’s Inequality 


Let 2p, Uk, Pes Gk (kK = 1,...,), p,andq be positive numbers. Then the function 


n 1/p n 1/q n 1/q n 1/p 
s)=(Sas") (S mut") +(Saa) (Sma) 
1 1 J; 1 


is convex (16.10). If 


Se) EJB)" 


n n 
> a=) = 1). 
1 


1 


(ce. if 


then S(x) has a minimum at x = 0. 


Result 16.34. [255]. Let ax, be, ce, dy (kK = 1,...,), p, and q be positive 
numbers. Then the function 


1+(p/q)x,1—x )1/P 1+(q/p) lx ) 1/4 
a b Cc d 
Si (x) = | k k > k k 


> acby > cds 
¥ al-*plta/p)s 1/q Yih xg t/a HP 
kk kk 
ian kel ed atk Fk 
> abe Di cKdk 


is convex with a minimum at x = 0. 
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Applications 


Applications in binomial expansion, scalar products, economics, the Cobb-Douglas 
production function and quasilinearity, interest formulas, physics, Gaussian func- 
tions, Chebyshev polynomials, determinants, geometry, statistics, information the- 
ory, and sums of powers are considered in this chapter. 

Functional equations is a growing and fascinating part of mathematics because 
of its intrinsic mathematical beauty as well as its applications. Applications of func- 
tional equations abound in mathematics and elsewhere in diverse fields. This chapter 
is devoted to applications of functional equations in various fields. Applications have 
contributed to the growth and development of functional equations. Applications 
can be found in many fields—physics, economics, behavioural and social sciences, 
cognitive science, group theory, information theory, fuzzy set theory, artificial intel- 
ligence, iteration, dynamical systems, psychometry, nondifferentiable functions, etc. 
The list is getting longer by the day. It is nearly impossible to provide an informative 
survey of all the works on the various applications of functional equations. We have 
seen some in Chapter 1, primarily devoted to Cauchy and Pexider equations, and 
more in various other chapters. In what follows, I present some results or examples 
chosen from many fields in which functional equations play an important role. We 
will illustrate this with many interesting examples arising from geometry, physics, 
information theory, economics, statistics, topology, and taxation. 

Functional equations have contributed greatly to the development of appropriate 
methods to handle and solve many problems (see the general references by Aczél 
[59, (i),Gi)], [12], Aczél and Dhombres [44], and Castillo and Ruiz-Cobo [144]). 


17.1 Binomial Expansion 


In Lacroix [592], Newton’s binomial theorem 


(17.1) (l+x)"=1+ A@sx + APW@x’+-- =>) fpWx” 


n=0 


Pl. Kannappan, Functional Equations and Inequalities with Applications, 669 
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with fo(u) = 1, x,u € R, f, : R — R was treated, and it was pointed out that 
fi(u) determines the other functions f,(u), k > 2. Cauchy [147] used functional 
equations to obtain Newton’s binomial theorem 


=! —~1)---(u—-n—1 

(17.2) +x)" = Lux MOO D4. ee a ele ee 
! n! 

Without going into the intricacies of convergence, etc. (which are fundamental for 

(17.2)), we obtain (17.2) from (17.1) by applying Cauchy’s additive equation (A). 

Replacing u by (u + v) in (17.1) and (1 + x)“*? = (14+ x)"-(1+ x)", u,v ER, 

we get the following: 


(i) By comparing the coefficient of x, we have 
fiut+o)= fiu)t+ fi@), foru,o € R; 


say fi (u) = A,(u), where A is a solution of (A). 
(ii) By comparing the coefficients of x*, we obtain 


fou t+ ov) = fo) + foe) + Ai)A1(0-) 


1 
= fou) + PO) + 5IAiw + v)? — Ay(u)* — Ai(v)7); 


that is, 
A2(u + v) = Agu) + Aro), 
where 
Ap(u) = falu) — 5Arlu)? 
or 


fo(u) = A2(u) + 5A’, foru eR. 
(iii) One more comparison of the coefficients of x? results in 
But+v) = pw+ BoO)+ hWhAe)+ hoAW 

1 

= f3(u) + f3(v) + Eo + Antu? | Ai(v) 
a Eo te Avo? | Ai(u) 

1 
= fru) + fav) + SLA @)"A1@) + A1O)A1@)] 

+ A2g(u)Ai(v) + A2(v)Ai(u) 


1 1 1 
= fw) + fe) + cA t+ vo) - ZAiuy - ZAilo) 
+ Ai(u+v)A2(u + v) — Ai (u)A2(u) — Aj (v)A2(0); 
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that is, 
A3(u+v) = A3(u)+ A3(v0), foru,v € R, 
where 
Aa(u) = fala) — ZA) — Aru Ar(u) 
or 


AOS Aw FAi(u)? + Ai(u)A2(u). 


Assuming the regularity conditions on fj, /2, 3 and noting that f| (1) 
1, fol) =0, fa(1) =Oand Ai(1) = 1, Ar) = 4, A3(1) = 8 +5 


—4, we have 
fi) =u, 
1 1 1 
frou) = aa + a = zulu — 1), 
(u) = xu + S08 — Su? = Luu — Nw —2) 
Bw) = zu ze he eee u : 


resulting in (17.2) (compare Chapter 1—Sums of Powers). Here the additive 
equation is used. 


17.2 Scalar or Dot Product 


Characterization of the scalar product is by means of the following properties [144]: 


(i) The scalar product is invariant when the two vectors undergo the same rotation, 
(i1) 
(cx): y =c@-y)=x- (cy), 
where x and y are vectors and c is a scalar, 
(iii) 
(e1 + e€2)-e3 =e, -€3 +e2- 63, 
where e1, €2, and e3 are unit and coplanar vectors. 


Because of the property (i), the scalar product can be written as 
x-y=h(@,|xI, ly), 


where /: is a function to be determined; that is, the scalar product of two vectors x 
and y depends on the moduli |x| and |y| of the vectors and their angle 0. 
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Condition (ii) gives 


AG, |x|, ly) = lxllylA@, 1, D = Iellylf@), 


where f (@) is a function to be determined. 
From condition (iii), taking into account that the vector sum of two unit and 
coplanar vectors making an angle @ is a vector with modulus 2 cos(@/2), we can 


write 
200s (“S*) ¢(*) = flu)+ f(r), 


where, as indicated in the figure below, u and v are angles associated with the pairs 
(e1, €3) and (e2, e3), respectively. 


e3 


Making the change of variable 
u=t+s, v=t-—s, 


leads to 


2cossf(t)= f(t+s)+ f(t —s), 


which is the same equation as (3.14). Thus its general solution is 
f(t)=acost+bsint, teER. 


If we perform a rotation of an angle z around the axis e2, we transform the pair 
(e1, €2) into (€;, e2) and, because of property (i), we have 


e1-e2 = f(A) =e) -e2 = f(—8) 
=> acost +bsint = acos(—t) + bsin(—t) 
>b=0. 


Consequently, x - y = |x||ylacos@. 
Here a trigonometric equation is used. The vector or cross product can be char- 
acterized similarly (see [144]). 
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17.3 Economics 


17.3.1 Duopoly Model 


This example deals with two firms, F; and F, that compete in the market. Natu- 
rally the sale function is a function of the prices p and advertising expenditures . 
Suppose S(p1, p2,41, 42), 8 : R4 — R, gives the sales of firm F; as a function of 
pi and p2 and 4; and 22, where p; and pz are the prices and A; and /2 are adver- 
tising expenditures of the firms F and F2. To determine S, some natural (practical) 
assumptions are made. We deal with two models based on different assumptions. 


Assumption 1. S(p1, p2, 41, 42) is increasing in p2 and /;. It is natural to assume 
that an increase in the unit price p2 of F2 increases the sales of F}. It is also natural 
to assume that an increase in the advertising expenditures 4; of F| increases its sales. 


Assumption 2. $(p1, p2, 41, 42) is decreasing in p;. An increase in the price p,; of 
F{ causes a decrease in its sales. 


Assumption 3. The S(p1, p2, 41, 42) function is continuous, at least at a point, in 
all its arguments. 


Remark. Note that we do not assume that $(p1, p2, 41, 22) is decreasing with res- 
pect to A2 since the advertising effort of one company can also benefit its opponent. 


17.3.1.1 Duopoly Model I 


Assumption 4. An additive change in prices and advertising expenditures leads to 
an increment in sales that is the sum of the independent contributions of prices and 
advertising expenditures. More precisely, 


S(pi + 91,p2 + 92,41 + m1, 22 + 2) 


(17.3) 
= S(p1, p2, 41,42) + U(1, G2, Pi, p2) + V(r, M2, A1, A2) 


for P1415 P25 492541, H1, 42, 2 € Ry. 

Equation (17.3) expresses that when changing prices and advertising expendi- 
tures, the resulting sales S(p1 + q1, p2 + 92,41 + 41,42 + m2) are the old sales 
S(p1, p2, 41, 42) plus two increments: U (q1, 42, Pi, p2), which depends only on 
prices, and V (21, “2, 21, 42), which depends only on advertising expenditures. This 
means that prices and advertising act independently (i.e., without interactions). 

The general solution of functional equation (17.3) is 


S(p1, p2, 41,42) = a(pi, p2) + BAL, 42), 
(17.4) U(q1, 92; Pi, P2) = (pi + 41, p2 + 42) — a(P1, p2) — 4, 
Vii, #2, 41,22) =at BAL + mi, 424+ br) — B(A1, 42), 


where a(p1, p2) and f(A, 42) are arbitrary functions and a is an arbitrary constant. 
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Interchanging 2; and wy and A2 and «2 in (17.3) and using (17.3), we obtain 


S(P1, P2, M1, #2) + U1, 92, Pi, p2) + VAL, A2, M1, #2) 
= S(p1, p2, 41, 42) + U(q1, 92, Pi, p2) + V(H1, M2, 41, 42). 
With “1 = 0 = wz, this equation yields $ in (17.4). Substitution of this value of S 
in (17.3) gives 
a(pi +41; p2 +42) + BAL + 1,42 + b2) 
= a(pi, p2) + BU, 22) + U(G1, 42, Pi, p2) + V(u1, 2, A1, 42). 
Separating p, q’s and 1, w’s, we obtain the U and V in (17.4). 

Consequently, all functions S(p1, p2, 41, 42) satisfying Assumption 4 have the 
structure in (17.3) (Le., prices and advertising expenditures act independently), not 
only for the increments of sales but for the sales themselves. The fact that we get 
two arbitrary functions means that Assumption 4 is insufficient to determine the full 
structure of S. Thus we can make new assumptions, such as, for example, assuming 
that the U function is independent of p; and p2 and the V function is independent of 


A, and A2. With this, functional equation (17.3) transforms to the following assump- 
tion. 


Assumption 4a. 


S(pit+aqi, p2+q2,41 + pi, 42 + 2) 


(17.3a) 
= S(p1, p2, 41, 42) + U (qi, q2) + Va, b2). 


Its general solution is 


S(p1, p2, 41,42) =a + bpi +cp2+djd, + d2d2, 
(17.4a) U(q1,92) = —a+bqi t+ cq, 
V(41, H2) =at+dimi +dzpM02, 


where a, b, c, dj, and dz are arbitrary constants, which, according to Assump- 
tions 1, 2, and 3, must satisfy the inequality constraints c,d > 0 andb < 0. 
Interchanging p; and q, and p2 and qz in (17.3a), we get 


S(q1,925 41,42) + U(p1, p2) + V(u1, 2) 
= S(p1, pr, 41,42) + U(q1, 42) + V(e1, #2); 
that is, 
S(p1, p2, 41, 42) — U(p1, p2) = S(q1, 92, A1, 42) — U1, 92) 
=r(Aj,A2) (say). 
Substituting this value of S in (17.3a) yields 


U(pitai, pot+42)+r(At mi, 42+ “2) 
= U(pt, p2) +r (Ar, A2) + U (qi, 2) + V (eI, be2); 
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that is, 


U(pi+ai, p2 +42) = U(p1, p2) + U(q1, q2) +4, 
r(Ay + wi, 42 + M2) =r (Aq, A2) + Vue, 2) — 


where a is a constant. Note that V and r differ by a constant. 
Hence we get (see Chapter 1) 


U(p1, p2) = —a + Aj (pi) + A2(p2), 
r(Ay, A2) = A3(A1) + Ag(2), 


and 


Vii, M2) = +a + A3(“1) + Aa(ue2), 


and so 


S(p1, p2, 41,42) =a t+ A(p1) + A2(p2) + A3(ue1) + Ag(u2), 


where A; (i = | to 4) satisfy (A). 

By Assumption 1, we get Ao(p) = cop, A3(u) = c3u; by Assumption 2, 
Ai(p) =c, p; and by Assumption 3, A4(“) = cay. Thus we obtain (17.4a). 

It is worth mentioning that each of the variables pj, p2, 41, and A2 act indepen- 
dently, which is obvious from (17.4a) (one summand for each) but was not from its 
equivalent condition (17.3), where they were grouped in couples. 

Note that the linear structure of S(p1, p2, 01,02) is a direct consequence of 
(17.3a). The arbitrary constants a, b, c, d,, and d2 are called parameters. Thus we 
obtain a parametric family of solutions that is an extension of the model proposed in 
[302]. 


17.3.1.2 Duopoly Model II 


In this model, we make a slight modification of the previous model, assuming multi- 
plicative changes in advertising. 


Assumption 5. An additive change in prices and a multiplicative change in adver- 
tising expenditures leads to an increment in sales that is the sum of the independent 
contributions of prices and advertising expenditures. More precisely, 


S(pr +i, p2 + 92,21 M1, A2b"2) 


(17.5) 
= S(p1, p2, 41,42) +U (G1, 92, Pis p2) + VOCAL, 42, HI, L2). 


The general solution of (17.5), which can be obtained in a similar way as the 
solution of (17.3), is 
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S(p1, p2, 41, 42) = a(p1, p2) + BUA, A2), 

(17.6) U(41, 92; Pi, P2) = (pi t+ 41, p2 + 92) — a(p1, p2) — 4, 
V(A1, 42, M1, H2) =4+ BUI M1, A242) — B(u1, #2), 


where a(p1, p2) and f (01, v2) are arbitrary functions and a is an arbitrary constant. 

As before, interchanging p; and q; and p2 and q2, we have S in (17.6). Sub- 
stitution of this S in (17.5) yields U and V in (17.6). It is interesting to see that 
the S functions in (17.3) and (17.5) coincide, showing that Assumptions 4 and 5 are 
equivalent. 

If now we assume that the U function is independent of p; and p2 and the V 
function is independent of 2; and /2, as before, functional equation (17.5) transforms 
to the following assumption. 


Assumption 5a. 
(17.5a) 
S(pi +41, p2 + 42, Afi, A2¢2) = S(P1, Pr, HA, #2) + U (qi, q2) + V(t, 42), 


whose general solution is 


S(p1, p2, 41, 42) =at+ bp, +cp2 +d log 4, + dz log 2, 
(17.6a) U(p1, p2) = —at+ bpi +cp2, 
V(Aj, 42) =a4+d, log 21 + dz log 22, 


where a, b, c, dj, and d2 are arbitrary constants, which, according to Assump- 
tions 1, 2, and 3, must satisfy the inequality constraints c,d > 0 and b < 0. Once 
again, P1, p2, 01, and v2 act independently. 
Interchanging p; and q; and A; and yw; (i = 1, 2) separately in (17.5a) results in 
S(p1, p2, #1, #2) = U(pi, p2) + r(H1, M2) (say) 
and 
r(H1, #2) — V(u1, #2) = constant = cy. 
This value of S in (17.5a) gives 
U(pi + p2, 91 + 42) = U(p1, p2) + U (41, 42) + €2; 


that is, 


U(p1, p2) = ¢3 + Al (pi) + A2(p2) 


and 


r(Ai ui, A2M2) = (41, M2) + V(A1, Az) + constant, 
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meaning 


V(A1, A2) — r(A1, 42) = constant, 
VA HI, A2u2) = V(A1, A2) + Vue, 2) + €4, 
V(Aq, 42) = e5 + L(A) + L242), 


where A; (i = 1,2) are additive and L; (i = 1,2) are logarithmic. Assumptions | 
to 3 give 
Ai(p)=bip, Li) =djloga, i=1,2, 


yielding (17.6a). 

Equation (17.6a) shows that S is more sensitive to prices than to advertising 
expenditures. However, this is not a surprise since in (17.5a) we required a larger 
effort in advertising (multiplicative) than in price (additive) for the same increment 
(additive) in sales. This assumption led to (17.6a) as the only solution and to the 
appearance of the logarithm function, which has not been arbitrarly selected but 
arises as a consequence of the assumptions. 

Now we have to explain why (17.3a) and (17.5a) have the same solution for S. 
The reason is what makes the difference in the independence on p; and p2 of the U 
function and the independence of 4; and 2 of the V function, which was assumed 
only for equations (17.3a) and (17.5a). In other words, the different forms of the U 
and V functions in solutions (17.4) and (17.6) cover for the differences in functional 
equations (17.4) and (17.5) without these differences being apparent through the 
S(P1, P2, 01, v2) function. 


17.3.2. Cobb-Douglas (CD) Production Function 
and Quasilinearity 


17.3.2.1 Quasilinearity 


Let D; C R” bea domain. A function ® : Dj — R is called quasilinear (Aczél 
[12, p. 154]) (see also Chapter 15) if there exist real constants a}, a2,...,@,, b with 
a\a2-+-d, # O and a continuous and strictly monotonic function f with inverse 
ia = , both with suitable domains, such that 


(QL) = D(x, x2, ..- Xn) = fo Mar f (x1) + ao f (x2) + --- tan f en) +0). 


Ifb =0, ay = a2 =--- = ay, = 1, (QL) becomes the so-called quasiaddition 


@(x1,%2,...,%n) = FEF 1) + fra) +++ +f On) 


If b = 0, a1 > 0,a2 > 0,...,an > 0, D> a, = 1, (QL) becomes the so-called 
quasilinear mean. If f(x) = x, the word quasi can be deleted in the preceding 
definition. 

It is well known that a linearly homogeneous production function ® : R4_ x 
IR — R, with CES (constant elasticity of substitution), a, is a CD (Cobb and 
Douglas [167]) production function 
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(CD) @(x1, x2) =r 8, 
c> 0, a,a@ € Ri, with a; + a2 = 1, foro = 1, or an ACMS (Arrow, Chenery, 
Minhas, and Solow [79]) production function 
= Sa jpee il 
(ACMS) (x1, x2) = (Bix, ” + Boxy”) ?, 


p>-l, p #0, fi, fo > 0, foro = 1/1 +p). 
Note that, if c = 1, the function value (CD) is a mean value of order 1, ag if 
fi + fo = 1, the function value (ACMS) is a mean value of order —p = 1 — 


In what follows. let us call a function ® : R4_x---xR*4 (n times (n > 2)) eH oR, 
a CD-type function if 
(17.7) (x1, X2,..-,Xn) Sie, Ry ore 


c > 0, anda; +---+a@, = 1, and an ACMS-type function if 


2 = apeudl 
(17.7a) O(x1,.02,...,4n) = (Bix, ? + Boxy? +--+ + Bax, ?) ? 


Bi > 0, c #0. 

Several characteristics of the CD-type functions, by means of functional equa- 
tions, can be found in Eichhorn [250]. 

Production functions of type (17.7a) with p > —1, fi > 0, fo > 0,..., hn > 0 
have been characterized via partial elasticities of substitution. 

Now a joint characterization of the CD-type and the ACMS-type functions by 
means of a property that differs completely from CES is given. This property is the 
so-called quasilinearity of (17.7) and (17.7a). It will be shown that these are the only 
systems of functions that are at the same time linearly homogeneous and quasilinear. 
In passing, it may be noted that differentiability assumptions are not needed. 

Note that both the CD-type functions and the ACMS-type functions are quasilin- 
ear: 

f@)Sloex, f' WSS. 
is CD (17.7); that is, 


@(x1, x2, arate Xn) = exp(a log x1 = a2 log x1 + a se an log Xn + log C), 
fase. J WSs 


b = O and gives ACMS (17.7a). 
Moreover, both the CD-type functions and the ACMS-type functions are linearly 
homogeneous. 


i 
p 
> 


17.3.2.2 Determination of all Quasilinear, Linearly 
Homogeneous Functions 


Theorem 17.1. (Eichhorn [249], Eichhorn and Kolm [251]). Let the function 
® : Rt — R be linearly homogeneous. It is a CD-type function (see (17.7)) or 
an ACMS-type function (see (17.7a)) if and only if it is quasilinear (QL) (see also 
Chapter 15). 
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Proof. The proof involves Cauchy equations, extensions, and the solution of (1.94b)). 
Let ® be a solution of (QL). Then, by linear homogeneity, we get 


Af [arf 1) +--+ t+ anf (Xn) +b] = flair f Axi) +--+ + anf xn) + 41, 


where 1 > Oand aj, a2,...,Gn, Db real constants with aja2---ay, 4 0. 
Set 


eQv= so), fa@) := f Gx), 
fy) =:uy v=1,2,...,n, Ay) = frag), 


in the equation above to have 
(17.8) hy(aquy + +++ + aptly +b) = ayhy(uy) +--+ + anhan) +4, 


2 > 0, a1, 42,...,4n, b real constants with aja2---a, 4 0, the solution of which 
is given in Lemma 17.2. hj is not necessarily defined on all of R. It is defined on 
the open set S = J; U1), where 1) = {u: u = f(x), x € R%4} is open in R (f is 
a continuous, strictly monotonic function, and I, = fajt+---tanh+b}={vo: 
D = au, +--+ aquyn +b, uj € 1)}). 


Lemma 17.2. [251]. For every fixed 2 > 0, each nonconstant continuous solution 
hy: S CR—- R of equation (17.8) can be written as 


fb, foruelh, 


17.8 hj = aju+ 
(17.8a) hy (u) = aju i pnwel. 


where a, #0, fi, and f, are suitably chosen real constants. 


From (17.8) with u = f(x), az = a(x), By = BA), 6, = f(A), and the 
definitions of g, {;, 42, we see an equation like (1.94b,), 


fQx)=aQA)f@)+ BQO), aQ) #0 forall 2. 
From Result 1.61, we have from (1.94b;) either 


f(x) = L(x) + e1 = g(x) = L(x) + €2, 
h(x) = a(x) = 3, 


or 
k(x) = B(x) = c3L (x) + c1 — €203; 
that is, f(x) = L(x) +b, a(v) = 1, B(x) = L(x), L logarithmic (from continuity 


and strict monotonicity), and f(x) = alogx +b. This results in CD-type function 
(17.7) or 
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Ff (x) = cc3(M(x) — 1) + c1 = 8) = c(M(x) — 1) +€2, 
h(x) = c3M(x) = a(x), 
k(x) = (cc3 — c2¢3)(M(x) — 1) + e1 — c2c3 = BQ); 
that is, 
fx) = ce(M(x) — 1) +1, 
a(x) = M(x), 
B(x) = (c —e1)(M(x) — UD), 


where M is multiplicative (M) (and from the continuity and monotonicity), and we 
have that f(x) = rx? +0, r, p € 0 results in 


1 

n = 

=x. 3 r 

Hr) = [3 wi + Ob ad to band 0 > 
i=1 


which is ACMS-type function (17.7a). The converse is easy to check. This proves 
the theorem. 


17.4 Business Mathematics—Interest 


17.4.1 Interest Formula 


See Castillo and Ruiz-Cobo [144] and Eichhorn [250], and let an amount K be de- 
posited for a period of length ¢t and F'(K, t) be the amount accrued by interest com- 
pounding. F(K, t) is subject to two conditions: 


(i) F does not change by splitting K into two parts, x + y, and, left for the same 
period, this yields 


Fa@t+y,)=FO,DO+F0,1), x,yeRy, teR}. 


That is, the final amount depends only on the total initial investment but not on 
the number of investments it can be divided into. 

(ii) The amount to which F(K, t,) during a time interval of length f2 is equal to the 
amount to which K increases during the period f; + f2 gives 


F(K,ti +) = F(F(K,t),%), K,t,t2 € Ri; 


that is, the final amount only depends on the total time, t; + f2, the capital is 
deposited, but not on the tentative period of deposit. 


For fixed rf, the first equation is an additive Cauchy equation giving (by Theo- 
rem 1.2) 
F(x,t)=c(t)x forx e€R,, 


17.4 Business Mathematics—Interest 681 
where c(t) > 0. This F in the second equation gives 
c(t) + f2) =c(t)c(t2) fort, to > 0. 


Since c(t) is a monotonically increasing function satisfying (E) by Corollary 1.36a, 
c(t) =e”, where d > 0. Thus F(x, t) = xq’, where g = e@ > 1. 
It is interesting to point out the following three important facts: 


Ga) If f(x, +t) < fLFG@, 1), fo], we get more if we cancel the actual deposit 
and initiate a new one by reinvestment. 
(i) If fwt+y,t) < f@,t)+ f(y, ¢), we are invited to make many small invest- 
ments instead of a single one for the whole amount. 
(iii) If the inequalities in (i) and (ji) are reversed, the bank is offering more than 
necessary. 


Thus, under this point of view, the optimal bank offer corresponds to the equali- 
ties of the expressions above. 


17.4.2 Simple Interest 


See [144]. Let F(k, t) be the accrued amount of capital k invested for a period of 
length t. Then the simple interest F'(k, t) is subject to the following two conditions: 


(i) As in (i), above 
Fa@ty,)N=FG,ON+FQ,1), x, ye Ry, te R}. 


(ii) The final amount depends only on the investment and the total time and not on 
the periods of time it can be divided into; that is, 


F(k, ti +t) = F(k, ti) + Fk, ft), »« € Ri, ti,t2 € Ri. 


As in the example above, (i) gives F(x,t) = c(t)x and this F in (ii) yields 
c(t) + f2) = c(t) + c(t2), which, again by Theorem 1.2, yields c(t) = bt and then 
F (x,t) = bxt (cf. the application in Chapter 1). 


17.4.3 Interest Rates 


See [144]. In the examples above, we characterized the compound and simple inter- 
est formulas from some properties that were written as functional equations. How- 
ever, some actual bank policies differ from those previously stated. As a matter of 
fact, the interest rate is usually dependent on the period and/or the amount of the 
deposit. This motivates the search for new interest formulas to be applied in the new 
situation of the actual economy. 

As in the second example, let F(x, t) be the future value of the capital x invested 
for a period of time of duration ft, and let h(x, t) = F(x,t)/x be the accumulated 
investment rate. These two properties suggest the functional equation 


682 17 Applications 
A(x, ty tt) =h, thx, tr), x,t ER, 


where we assume h(x, 0) = 1 and h is continuous and increasing in both arguments. 
This equation for fixed x is a Cauchy equation (E), so its general continuous solution 
is given by 


(17.9) h(x, t) = exp[C(x)t] = p(x)’ > F(x, t) = xp(xy', 


where p(x) is a nonnegative and increasing arbitrary function. Note that the cases 
of compound and continuous interest are particular cases with p(x) = 1 + x and 
p(x) = exp(x), respectively. Note also that the case F(x, t) = x[1 + r(x)]! (that is, 
with the interest rate being dependent on the initial investment) is included. 

Another possibility is to assume that the accumulated investment rate for an in- 
vestment x is the accumulated investment rate for a period of duration y raised to a 
power that is a function of y and x; that is, 


_ k(y,x) * 
h(x,t) =h(y,t) cies t,x,yeR,, 
with general (with the indicated conditions) solution 


h(x, t) = pO") = mony"; k(y,x) = LO = EMO) 
q(y) — logm(y) 


Hence, we get 
fx, t) = x(x)", 


which generalizes (17.9). 
Finally, we can assume that the accumulated investment rate for a period t) + 
can be obtained from the accumulated investment rates for periods f; and f2; that is, 


h(x, ty + t2) = U[h(x, t1), AC, t2)). 
Thus, its general solution is 
h(x,t) = wftp@)], UG,y) = wlw'(e) +0!) 
where w(x) and p(x) are arbitrary functions and w(x) is invertible. This leads to 
F(x, t) =xul[tp(x)], 


which also generalizes the interest formula (17.9). One important particular case is 
given by p(x) = x; that is, F(x, t) = xw(tx). The product measures the power of 
investment. 


17.5 Physics 


17.5.1 Quantum Physics 


D’ Alembert’s equation is one of the most important functional equations. This equa- 
tion was originally introduced as a tool for studying the vibrating string problem 
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(see Chapter 3 (VS)) and for axiomatic investigations of the parallelogram law for 
addition of vectors (see Chapter 1). It has been applied in non-Euclidean mechanics 
[44] and harmonic analysis (Hewitt and Zuckerman [370]). 

Investigations on the mathematical foundation of quantum mechanics have shown 
that a quantum mechanical influence function satisfies a generalized d’ Alembert 
functional equation. In his studies of discrete models for classical and quantum 
physics, Hemion [365, 366] introduced the concept of an influence f : R — R. The 
function f provides a measure of the influence between physical states (or config- 
urations). By applying the principle of strong causality (the future cannot influence 
the past), Hemion has argued that f must satisfy the condition 


(17.10) > fod =0> DIF@+ yi) + FO — yi] =0 


i=1 i=1 


forall x, yj € R. It is clear that (C) implies (17.10), so (17.10) provides an interesting 
generalization of (C). A physical interpretation states that if a present total influence 
vanishes, then it still vanishes when future effects are included. 

Closely related to (17.10) is the condition 


m n 


> fd) =>" 0) >= DU +x) + f@ — x0) 


al 7=! i=1 


(17.10a) = DUG +9) + F@— yA) 


j=l 


for all xj, yj € R. 

Conditions (17.10) and (17.10a) are equivalent if f is continuous and has a zero. 
If in addition f(0) = 1, then (C), (17.10), and (17.10a) are all equivalent (Gudder 
[330)]). 


Definition. Let V be a real topological linear space. A subset C is balanced if 
tC C C for all |t| < 1. A subset C is absorbing if for every x € V there exists an 
é > Osuch that tx € C for |t| < «. 


Result 17.3. [330]. Jf f : V — Ris continuous and satisfies (17.10a), then there ex- 
ists a continuous linear functional g : V — R such that either f(x) = f (0) cos g(x) 
or f(x) = f()cosh g(x) for all x € V. Further, g is unique in the sense that if 
f #O0and f(x) = f(O)cosk(x) or f(x) = f (0) coshk(x), for a continuous linear 
functional k, thenk = +g. 


Corollary 17.4. [330]. If f : V — R is continuous, satisfies (17.10), and has a 
zero, then there is a continuous linear functional g : V — R such that f(x) = 
Ff (0) cos g(x). 

Corollary 17.5. [330]. Let V be a real Hilbert space. If f : V > R is a continuous 
solution of (17.10a), then there is a unique y € V such that f(x) = f(O)cos(x, y) 
or f(x) = f()cosh(x, y) for all x € V (uniqueness means that if there is a z 
satisfying the representation above, then z = =y). 
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Corollary 17.6. [330]. Let V be a real Hilbert space. Suppose f : V > Risa 
continuous solution of (17.10) and has a zero. Then there exists a y € V such that 


f(x) = FO) cos(x, y). 


17.5.2 Gaussian Function 


In digital image analysis, edge detection, line detection, and texture classification 
are basic to image understanding and interpretation. Since a typical image is de- 
void of predetermined directions of edge, line, or texture, rotation-invariant filters 
are important for their detection. In designing such filters [342], the concept of a 
rotation-invariant separable function is often used. Also, in many problems in op- 
tics and quantum mechanics, we encounter such functions. For instance, two inde- 
pendent particles in quantum mechanics whose wave function is a product of the 
functions of individual coordinates are also a product when expressed in the centre 
of mass and relative coordinates. We show that every rotation-invariant, separable, 
real-valued function of two variables that satisfies the functional equation (17.18) is 
either Gaussian or identically zero. In Mann, Revzen, Khanna, and Takahashi [622], 
the authors study bifactorizable quantum wave functions. While proving that if two 
quantum systems are prepared independently and if their centre of mass is found to 
be in a pure state, then each of the component systems is also in a pure state, which 
in the coordinate representation is a Gaussian wave function, the authors [622] came 
across the functional equation (17.18). This result is established without assuming 
any regularity conditions on the unknown functions. Our approach is from the func- 
tional equation point of view and is simple, direct, and interesting. 

Let Q= {0 eR|OAF,n = 0,1,2,3,...}.A function f : R” — Ris said 
to be Gaussian if and only if 


C1711) f(1y 200.5 Xn) = keACTHIA HD), Oy, x9, 0.50) ER", 


where k is a real constant and A : R > R satisfies (A). A function G : R* > Ris 
said to be separable if there exist functions g, : R — R such that 


(17.12) G(x,y)=g(x)h(y) forx,yER. 
A function G : R? — R is said to be rotation invariant provided 


(17.12a) G(x, y) = G(x, yo) forall@ €Q, 


xo\ _ { cos@ sind) (x 
yo)  \-—sin@ cosO} \y}* 
If a function G : R* — R is rotation invariant and separable, then 


(17.13) g(x)h(y) = k(xo)€(vo), 


forx,y €R, 0 € Q, where g,h,k,€:R—-R. 


where 
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Main Theorem 17.7. (Kannappan and Sahoo [508]). Jf G : R? > R is a rotation- 
invariant separable function, then G is either identically zero or Gaussian. 


To establish this result, we need the following results. 


Lemma 17.8. [508]. Let f, g : R > R satisfy the functional equation 


(17.14) fat+y)=f@FO)sey) forx,y ER, 


where f and g are not identically zero functions. Then the general solution of 
(17.14) is given by 


1 
(17.15) FO) = ce2M@ItA2) g(x) = e442), forx ER, 

c 
where A,, Az: R > R satisfy (A) and c is an arbitrary nonzero real constant. 


Remark. First, the solution of (17.14) is obtained without assuming any regularity 
conditions whatsoever on f or g. Second, if we assume some regularity condition on 
f, say f is measurable or continuous or continuous at a point, then we obtain, as in 


Sahoo [721], f(x) = cer +h* and g(x) = tem, The measurability or continuity 
on f implies the same on g. 


Corollary 17.8a. The general nontrivial solution of 
(17.16) AG+y)= fAO)AO)s@y) forx,y ER, 
where fi, f2, 3,g : R > R, is given by 
fix) = abcete)+ 42) 
fox) = ~ fie) Bw) = fils) g(x) = Teh), forx ER, 


where Aj, A2 : R — R satisfy (A) and a, b, and c are arbitrary nonzero real 
constants. 


Corollary 17.8b. The general nontrivial solution of 
(17.17) fi —y)=AW&)AOIgey) forx,y ER, 
where fi, f2, 3,2: R= R, is given by 

fix) = abcetAie)+ A420) 


1 dl 1 ; 
AW=—Ne), A)=FfiCx), s&)= oe forx ER, 


where A,, Az: R > Rare additive anda, b, c are nonzero reals. 


Proof of the Main Theorem. (Kannappan and Sahoo [508]). If G is the zero 
function, it is obviously rotation invariant and separable. So, we assume G is not 
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identically zero. Suppose G(xo, yo) = 0. Then, by using separability and rotation 
invariance, we can show that G(x, y) = 0 for all x, y € R. The separability con- 
dition (17.12) gives g(xo)k(yo) = 0. Assume g(xo) = 0. Then again (17.12) gives 
G(xo, y) = 0 for all y. Given any (x1, y1), it is possible to find a suitable (xo, y) 
or a rotation of it, so that (x1, y1) is obtained as the image of (xo, y) by a suitable 
rotation. Hence, G(x;, yj) = 0, so G is nowhere zero. Then, from (17.13), we 
obtain 


(17.18) k(ux + vy)€(—vx + wy) = g(x)h(y),_ forallx,y ER, 


for all (u,v) not equal to (0,1), (1,0), (0,—1), (—1,0), where w = cos@ and 
v = sin@. Since G is nowhere zero, we conclude that k, €, g, and h are also 
nowhere zero. Letting x = 0 in (17.18), we get 


(17.19) h(y) = ak(vy)e€(uy), 

where a is a nonzero constant. Similarly, letting y = 0 in (17.18), we get 

(17.19a) g(x) = bk(ux)€(—vx), 

where b is a nonzero constant. Using (17.19) and (17.19a) in (17.18), we obtain 
k(ux + vy)€(—vx + py) = abk(ux)e(—vx)k(vy)e(uy), 


which is 
K(ux + vy) _ ab EY) 
k(ux)k(vy) €(—vx + wy) 


Replacing x by qx and y by 1 y in the equation above, we get 


(17.20) kv y) _ (#8) O69) 
‘ ————— = abh-———_.. 
k(x)k 
Letting | = x any nonzero real (recall a = cot, and it is possible to choose fc =x 


any nonzero real) in (17.20), we see that the right side of (17.20) is a function of xy. 
Hence we have 


(17.14) k(x + y) = k(x)k(y)o (xy) 


for all x, y € R with x 4 0. By Lemma 17.8 and the remark following it, we obtain 


(17.15) k(x) = cet +406), g(x) = LAr), 
G 
Letting (u,v) = (+. +) in (17.20), we get 


k(xt+y)  _, &(-x)€Q) 


we ERO) —VAa yy 
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Using (17.15) in (17.16), we see that 
€(—x + y) = abc(—x)€(y)e7 41), 


Thus, by Corollary 17.8b (which could use Corollary 17.84), we have 
(17.21) E(x) = —e241@ )+A3 (x) 
abc 


Using (17.15), (17.21), (17.12), (17.12a), and (17.18), we obtain 


G(x, y) = k(ux + vy)e(—vx + my) 
(17.22) 


=) gave? ty +42 (uxtoy) taser tay), 
ab 


Since G is rotation invariant for all 9 € Q, G in (17.22) remains the same for 


different choices of (u,v). By choosing (u,v) = (=. 43) , we get 


= 1 eo (Yate? 49%) 4 do (242) 4g (DEE 
(17.23) Glx.y) = exp (FAi6 +97) + aa (2) + a9 eG )). 


Similarly, choosing (u,v) = ( +3) , we see that (17.22) becomes 


1 
er, 


(1 ) Gry) 1 1 ( 2 2) —x-y x-—y 

: a = — — —— . 
7.23 G(x, y)= exp Ai(x° + y~) + A2 Va + A3 V2 
Hence, from (17.23) and (17.23a), we obtain 


for all x, y € R. Now, from (17.23), we have 


1 
G(x, y)= eBay"), 


This completes the proof of the theorem. 


We conclude this result with the following remark. A function G : R* > R is 
circularly symmetric if G(x, y) = f(/x?2 + y2) for some real-valued function f. 
It was shown in [721] that circularly symmetric separable functions are Gaussian. 
Circularly symmetric functions are precisely rotation invariant, and not the converse. 
In harmonic analysis, such functions are generally called radial functions [369]. Thus 
this result can also be deduced from [721]. However, here we accomplished our task 
by solving a quite different and interesting functional equation (17.14) and the related 
ones considered by others in different contexts [622]. Furthermore, this approach 
allows us to study linear transformation-invariant separable functions. 
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17.5.3 Chebyshev Polynomials 
17.5.3.1 Introduction 


We now discuss yet another example of the application of the cosine equation. 
In 1750, d’Alembert [189] introduced the cosine functional equation (C), which 
arose in modelling the motion of a stretched string [808] and in the foundations 
of mechanics. 

To see how one can “find” the cosine in (C), assume that the not identically zero 
solution f is twice continuously differentiable. Then, from 


fa ty)t+ fe —y)—2F() _ fQ) + fy) — 2f 0) 
eee ee Seer 
y y 
using f(—y) = f(y) and f (0) = 1, which follow from (C), and taking the limit as 
y tends to 0, one obtains 
f(x) = FO) FO). 
Hence 
cos(cx) if f”(0) < 0, 
fa)= | ie ooh 
cosh(cx) if f”(0) > 0, 


where c := ,/|f”(0)| (see Theorem 3.9, Chapter 3). 

It is worth remarking that d’Alembert was among those calling for a theory of 
limits that would justify the argument just given. It is also worth remarking that the 
technique of reducing a functional equation (such as (C)) to a differential equation 
has been a mainstay for the past 250 years (see [44]). 

Indeed, in Kannappan [424, Theorem 3.21], (C) was solved in great generality, 
in particular where x, y are elements of an additive Abelian group and f(x) is a 
complex number without assuming any regularity (e.g., continuity) in the function. 
In the theorem it is proved that, given a solution of d’ Alembert’s functional equation 
with f(0) = 1, there is a function e : dom(f) — C such that e(0) = 1 and 
e(x+y) = e(x)e(y) and 2 f (x) = e(x) + e(—x) for all x € dom(f). In the classical 
cases, e(x) = e! for cos(cx) and e(x) = e™ for cosh(cx). 

Now d’Alembert’s equation is solved when the domain of f is the additive group 
of the integers and the codomain of f is a commutative ring F’. It is here that, perhaps 
surprisingly, the Chebyshev polynomials show up. 


Definition. Chebyshev polynomial T,, € Z[X] is given by, form € Z7,, 


q 
(17.24) Tn (X) = > (jete — 1), 


k=0 


where g is the largest integer with 2g < m (see [197]). 
If p € Z[X], say 


p(X) = pot piX +--+ + paX4, 


17.5 Physics 689 
and ifr € F, then, as usual, 
p(r) := pot pir +--+ par’. 


The general reference for Chebyshev polynomials (Tchebycheff—hence T) is Rivlin 
[704]. The occurrence of T,, here is really as a polynomial, not a polynomial function 
as in Rivlin generally. 


Result 17.9. [215]. Let f : Z— F with f (0) = 1. Then 
(17.25) f(m+n)+ f(m—n)=2f(m)f(a), form,n€ Z, 


if, and only if, 
f(a) = Tini (Ff C)), neZ,. 


The identically zero function satisfies (17.25) but is not of the form T(,)(f (1); 
this is why f (0) = | is a constant assumption. 


17.5.3.2 Reduction to a Difference Equation 


The result below shows that the two variables m, n in (17.25) can, over Z, be re- 
placed by a single variable equation. 


Proposition 17.10. (Davison [211]). Let f : Z => F with f(0) = 1. Then f 
satisfies equation (17.25) if, and only if, 


(17.26) fm4+2)+fam)=2fU)f4+1), neZ. 
Proof. Assume f satisfies equation (17.25). Withm =n+1, n = 1, we get 
fmtl4+)4+fam4+l-)D)=2f4+fd); 


this is equation (17.26). 
Assume conversely that f satisfies equation (17.26). Then f is even; that is, 


fen) = f(r), neZ. 


It clearly suffices to prove evenness for all n € Z4. This is proved by induction on 
n € Z4. Itis trivially true form = 0. Also f(—1) = fC) since f (0) = 1. Assume it 
is true for alln € Z,, withO <n < N, where N > 1. Thenitis true forn = N +1; 
using (17.26), 


fN+1) + f(N —- 1) =2fQ) FW). 
Since f(N — 1) = f(—(N — 1)) and f(N) = f(—N), by the induction hypothesis 


(17.27) f(N+1 + fA —N) =2f()F(CN). 


690 17 Applications 


But again using equation (17.26) withn = —N — 1, 
(17.27a) fCN+0D4+ f(-N -1) =2f0)f(-N). 


Comparing the two equations above yields f(—N — 1) = f(N + 1); that is, f is 
even for all n € Z by induction. 

To show that f satisfies equation (17.25), the function g : Z* — F defined next 
must be identically zero: 


gk, &):= fk+O+fK-O-2fHOfO, &OH€Z. 
Now, since f, assumed to satisfy equation (17.26), has been shown to be even, 
sk. \=seb=s-kO, kN eZ. 


So iterating the involutions (k, £) > (€,k) and (k, £) — (—k, €), it follows that F 
is identically zero on Z? if, and only if, g is zero on 


Y:={(k,O¢€Z:0<€<k}. 


Using equation (17.26) to express f (K+ €) in terms of f(k+¢—1) and f(k+€—2) 
and similarly f(k — €) in terms of f(k — € — 1) and f(k — € — 2), it follows that 


(17.28) gtk, €) =2f0)F—-1,0)—-ek-2,0), (kd €Z?. 


The “size” of (k, €) € Y is k + €—the taxicab distance from (0, 0) to (k, €) in Y. By 
induction on the “size” of (k, £) € Y, it is easy to show, using equation (17.28), that 
g is zero on Y. (g(0,0) = O is true since f(0) = 1, as is g(1,0) = 0. g(,1) = 
f(2)+ fO)—2f(’) fC) = 0 since f satisfies (17.26).) This completes the proof 
that (17.26) implies (17.25). 


Corollary 17.11. Let f :Z— F with f (0) = 1. Then f satisfies equation (17.26) 


if, and only if, it is even and 


fn+2)+ f(n)=2f0)f@t), neZy. 


Proof. Assume f satisfies (17.26). Then, as above, f must be even. Clearly, f 
satisfies (17.28), as the domain of equation (17.26) includes the domain of equation 
(17.28). So this direction is proved. 

Assume, conversely, that f satisfies (17.28) and is even. Then 


f-)D+f/MO=fO+FM FCED= fd) 
=2f()f0) (fO)=D. 
So f satisfies equation (17.28) form = —1. Now letn € Z withn < —2. Then 
fa+2)+ fF) = fa) + fmt 2) 
= Jen at = 2). (== 2, =1-eZ) 
= 2f(I)f(—n - 1) (n—1eZ) 
=2f(I)f(n+ 1). 
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So f satisfies equation (17.28) for all integers n < —1. Thus 
fan+QY+fa)=2fOfat+), neZ, 


as claimed. 


Equation (17.26) is a linear difference equation of the second order: Since 
f() = 1 if f() is given, then f(2), and recursively f(3), f(4),... are deter- 
mined. 


17.5.3.3 The Universal Solution 


Definition. Let T : Z — Z[X] be given by T(0) = 1, TC) = X, T(—n) = 
T(n), n € Zz, and 


(17.29) T(n+2)+T(n)=2XT(n+1), ne Zq. 


It is customary to write T(n) as Ty(X). Thus 7)(X) = 2X? — 1, 73(X) = 4X3 — 
3X, T4(X) = 8X4 — 8X? + 1 follow immediately from equation (17.29). 

By Proposition 17.10, T : Z — Z(X) is a solution of d’ Alembert’s functional 
equation (17.25). Indeed more is true! 


Proposition 17.12. (Davison [213]). Let f : Z — R with f (0) = 1. If f satisfies 
equation (17.25), then 


(17.30) fa) =Ti(fQ)), ne Z, 
where n +> T,(X) is the family of polynomials from the definition above. 


Proof. Since both f and T are even (one by virtue of satisfying equation (17.25), the 
other by definition), it suffices to prove (17.30) for alln € Z,. Now f(0) = 1 = 
To(f C)), and f(1) = 7; (f (1)). So assume it has been shown that f(n) = T,(f (1)) 
for alln € Z4 within < N, where N € Z+ and N > 1. Then 


f(N+1) =2fCQ)fW) -— f(V - 1) (f satisfies (17.25)) 
= 2T\(f ))Ty(fC)) — Ty-1(f(1)) (induction hypothesis) 
= Ty+i(f()) (T satisfies (17.29)). 


Thus the result is true forn < N +1, so the result follows for all n € Z+. 
Note that what makes the preceding proof work is that, for each r € F, the 
evaluation ev; of p € Z(X) at r is ahomomorphism from Z(X) to F: 


ev;(p + gq) = ev; (p)+ eo-(q) p+") = pr) +¢)], 
ev, (p-q) = ev;-(p) +ev-(q)  [(pa)(r) = p(r)q(r)I. 
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So Proposition 17.12 can be restated in diagram form as 


E 


Given f satisfying equation (17.25), there is a unique homomorphism (of com- 
mutative rings) ev ¢(1) such that f = ev¢(1) o T. Also ev f(1) is completely specified 
by 

evfayd)=1 and eve (X) = fl): 


There is one, and only one, ring homomorphism that sends | (of Z) to 1 (of F’) and 
X of Z[X] tor € F. 


17.5.3.4 Identification of the Universal Solution 
The difference equation for T is 
(17.31) Tin+2)-2XT(v1+1)+T()=0, neZ,. 
The (quadratic) indicial equation for this is 
2? —2XA41=0. 


This has roots 


Ay=X4VX2-1, A =X-VX*- 1. 


These roots lie in the quadratic extension K of Z[X], where 


0 I) _[x?=+1 0 
Cer =1) 0] = OG. X=? 
x of xX =i 
ome es iy! 


(Note that the characteristic polynomial of 2, is t? — 2Xt + 1.) 
Hence, for some a; and a2 € K, 


so that, since 


2T(n)=ajAji +a2d5, neZy. 
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From the initial conditions, T (0) = 1, and T(1) = X, soa; = 1 anda2 = 1. 
Thus 


n 


27,(X) = >° (je [x2 = Di + Enix? = 1] 


j=0 


q 
— n Pama 0 Ge _ ‘yg 
2k , 


k=0 


where j = 2k since for j odd 1 + (—1)/ = 0. Hence the following result has been 
proved. 


Proposition 17.13. [211]. Suppose T : Z — Z[X] is given by (17.29). Then 
T(n) (= Ty(X)) is given by (17.24). In other words, the univeral solution to the 
d’Alembert equation over Z is given by the family of Chebyshev polynomials. 


Since A2 = i, 
x iff x = 
Paix = AF ey? 


and the solution is seen to agree with Kannappan’s general description. Define 
E(n) := A). Then E(m +n) = E(m)E(n), and E(0) = 1 and 


2T (n) = E(n) + E(-n). 


17.5.3.5 General Remarks 


One direction of the theorem says: If f : Z — F satisfies d’Alembert’s equation, 
then f(n) = T(n)(f(1)) for all integers n. This has been proved. Proposition 17.12 
gives this for the universal T, but Proposition 17.13 identifies the universal T as the 
Chebyshev family. 

The other direction of the theorem is just as easy now: The Chebyshev family is 
given by (E(n)+£(—n))/2 and so satisfies d’ Alembert’s equation, and consequently 
so does any homomorphic image via evaluation maps T,,(X) — T,(f (1)). Thus the 
theorem has been proved. 

Finally, the well-known definition of the Chebyshev polynomial (see Rivlin [704, 
equation (1.2)]) is a consequence of the theorem: Define for 9 € R the function 
f:Z— RbynPb cos(n@). Then f satisfies d’ Alembert’s functional equation, as 
was noted previously. Hence, by the theorem, 


(17.32) cos(n0) = f(n) = Tn(f (1) = Tn(cos 8). 
In a similar way, it can be shown that 


(17.32a) cosh(nt) = T,(cosht), néEZ,teER. 
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These equations can be subsumed under the general result 


xXn4+x-" X+x=! 
(17.33) a = n(=2), neZ, 


where the theorem has been applied to the function n t~ a = OF X—"1. 
So equation (17.32) follows from (17.33) by evaluating X at ef? as does (17.32a) 
with evaluation at e’. 


17.6 Topology 


Topology can be introduced axiomatically in many ways through open sets, closed 
sets, the interior operator, the closure operator, etc. Here it is done through the 
interior operator (see Kannappan [425]). For characterization by a single functional 
equation through the closure operator, see Monteiro [638], Aczél [10], and Lin [608]. 

It is well known that a set X together with a function i (called the “interior” 
operator) on subsets of X satisfying the following conditions can be endowed with a 
topology (Kannappan [425], Vaidyanathaswamy [808)]): 


(K1) i(C)CC forC CX: 

(K2) i(CNH)=i(C)Ni(H), C,HCX; 
(K3) i(X) =X; 

(K4) iG(C))=i(C), CCX. 


Evidently, (X, M) is an Abelian semigroup. Also, the axioms (K1) to (K4) are equiv- 
alent to 


(17.34) i(xX) = X, 
(17.35) i(CNH)=CNANiZ7(C)Ni*(A), forC,H CX, 
where i7(C) = i[i(C)]. 
Here we consider more generally the problem of defining a natural topology on 
an arbitrary Abelian semigroup and prove that, for an arbitrary Abelian semigroup G 


(written multiplicatively) with the identity element e, the conditions corresponding 
to (17.34) and (17.35), 


(17.34a) f)=e 
and 
(17.35a) fey =x fP@Fr7O), xy EGP?) = fF) 


(where f is a function from G into G), with further simple assumptions on the 
operation - in G, imply the analogues of (K2), (K4), and (K1) ((K3) corresponds 
to (17.34a)), 
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(17.36) fay=f@)FQO), x.y EG, 
(17.37) Foaly), 

and 

(17.38) f@) <x 


in a reasonably chosen ordering < . 

Also, further analogues of known properties of the interior will be derived from 
(17.34a) and (17.35a), and it will also be examined how these modify if the above- 
mentioned further assumption about the operation - in G is not made. Here the 
interior operator is characterized by a single functional equation. A counterexample 
is provided at the end to show that the same is not true in every Abelian semigroup 
with unit element without any further assumptions on (-). 


Result 17.14. (Kannappan [425]). Let G be an arbitrary Abelian semigroup (mul- 
tiplicative) with identity element e. Then, for a function f : G — G satisfying 
(17.34a), the two conditions (17.35a) and (17.36) with 


(17.39) fx) =xf?(x), x eG, 


are equivalent. 


Proof. Let f satisfy (17.35a). Putting y = e and using (17.34a), we get f(x) = 
xf), which is (17.39). From (17.35a), (17.39), and the Abelian property of G, 
we get (17.36). Conversely, from (17.36) and (17.39), (17.35a) is immediate. This 
completes the proof of this result. 


Remark. Equation (17.36) is a familiar property of the interior (K2). Naturally, one 
would expect (K4) 


(17.37) f(x) = f?(x) 


instead of (17.39). But a counterexample is provided at the end to illustrate that 
(17.34a) and (17.35a) do not imply (17.37) in every Abelian semigroup. But, as 
mentioned earlier, with the further assumption x - x = x (idempotence) in G for 
every x € G, (17.37) is also true. As a matter of fact, we prove the following result. 


Result 17.15. [425]. Let G be a semigroup (Abelian or not) such that x - x = x for 
every x € G. Then, if f : G > G satisfies (17.39), f also satisfies (17.37). 


Proof. From (17.39) and x - x = x, for every x € G, it follows that 
(17.40) f(x) =xf?(x) =x-xf?(x) =xf (x) forallx €G. 
Replacing x by f(x) in (17.40) and using (17.39), we obtain 

PoeaH=foroa=r Ora @=sh). 
which is (17.37). This completes the proof of this result. 
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In order to obtain the analogues (17.38) of (K1) and further inclusion, the relation 
< is defined in G as follows. 


Definition. For x, y € G, define x > y if, and only if, there exists a z € G such that 
xXZ=y. 


Remark. This relation > in G is reflexive and transitive, so it is a quasiordering. 

In fact, for every x € G, xe = x. So, by the definition, x > x. Let x > y and 
y > z. Then there are u,v € G such that xu = y and yo = z. Hence xuv = yo =z. 
Thus x > z. 


Theorem 17.16. [425]. Let G be an Abelian semigroup with the quasiordering 
defined by the definition above. If f : G > G satisfies (17.34a) and (17.35a), then 
f also satisfies 


(17.41) x>y implies f(x)>fy), x,y EG, 
(17.41a) x> f(x), «EG, 

(17.41b) F@)= F@), 

and 

(17.41c) fa) = f?Q). 


Proof. Let f satisfy (17.34a) and (17.35a). Let x > y. Then there exists az € G such 
that xz = y. Since f satisfies (17.35a), by Result 17.14 we have f(y) = f(xz) = 
f (x) f(z), so, by the definition, f(x) => f(y). Since f satisfies (17.35a), f also 
satisfies (17.39), from which there results that x > f(x) and f?(x) > f(x). From 
(17.41) and (17.38) follows (17.41c). Thus the proof of this result is complete. 


Remark. Unless the quasiordering > in G is a partial ordering (that is, x > y and 
y > x imply x = y), (17.41b) and (17.41c) in general do not imply (17.38), as is 
shown in the following example. 


Counterexample. Let G be a group (Abelian) such that x? = e for every x € G. 
For example, let G be {e, a, b, c} and the multiplication defined by 


elabc 


alecb 
blceea’ 
clbae 


Let f : G > G be defined such that 
(17.42) fle)=e, fa=b, flb)=c, and f(c)=a. 


Then, evidently, f(xy) = f(x) f() for all x, y € G, so (17.36) is satisfied. From 
the definition of f, we have 


Pre PoO=a PO=a, ad ~O@=b. 
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Evidently, f7(x) ¢ f(x) for x € G\{e}. Hence (17.37) is false. Further, 
(17.43) ef-(e)=e, af*(a)=b, bf*(b)=c, and cf(c) =a. 


From (17.42) and (17.43), we get x - f?(@) = f(x) for all x € G. Hence (17.37) is 
true. Thus, in general, (17.34a), (17.36), and (17.39) (or (17.34a) and (17.35a)) do 
not imply (17.37). 


Remark. (Lin [608]). The commutativity requirement can be removed when the 
right side of (17.35a) is suitably rearranged: 


(17.35b) f@y) = f?@)xyf?). 
If f : G > G satisfies (17.34a) and (17.35b), then 
(i) (17.36) and 
(17.39a) fo=a OS as 
hold. (The converse also holds.) 


(ii) f7(x) = f(x) & Ff (x) is idempotent for every x. 


17.6.1 Integral 
17.6.1.1 Simpson’s Rule 


The general solution of the functional equation 


(17.44) f(x) — g(y) = @ — y)[h(sx + ty) + o(%) + wO)] 


for all x, y € R with s and ¢ being a priori known parameters is determined without 
any regularity assumptions (differentiability, continuity, measurability, etc.) imposed 
on the real functions f, g, h, $, and y. The motivation for studying this equation 
came from Simpson’s Rule for evaluating definite integrals (Kannappan, Riedel, and 
Sahoo [500]). 

In connection with Simpson’s rule for evaluating definite integrals, one comes 
across the functional equation 


74s) fe) £0) = SS] e000 +4 (SZ) + co], 


where /f is an antiderivative of g. This equation is a special case of 


(17.44a) f@)-gQ) =@—-ylha@+y)+yv@)+¢O)], 


where f,g,h, y,¢ : R — R are unknown functions. First, we treat equation 
(17.44a) (see [583]). The middle term of equation (17.45), 4g (=), is due to 
the fact that in Simpon’s Rule one partitions the interval into subintervals of equal 


lengths. However, there is no reason why one should be restricted to such an equal 
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partition. If we allow unequal partitions, then the middle term is no longer of the 


form 4g (=) but rather is of the form ag(sx + ty), where a, s, and f are constants. 


Taking this into account, we have a generalization of (17.44a) to (17.44b), 


(17.44b) F(x) — Ff) — & — y)[hGx + ty) + g(x) + gy), 


for all x, y € R, with s and ¢ being a priori chosen parameters. 

Our objective now is to determine the general solution of the functional equation 
(17.44b) without any regularity assumptions (differentiability, continuity, measura- 
bility, etc.) imposed on the unknown functions. Further, utilizing the solution of this 
equation, we determine the general solution of the functional equation (17.44). 

Before obtaining the solution of (17.44), we give the motivation—Simpson’s 
Rule (Kannappan and Sahoo [514]). 

For a function f : R — R differentiable on R, the mean value theorem states 
that 

f@)-~FfO) _ 
x-y 7 


f'(n), 


which holds for 7 between x and y (see Chapter 7). Obviously, the mean value 7 
depends on x and y, and one may ask for what f the mean value 7 depends on x and 
y in a given manner. For more information, see Aczél [31], Aczél and Kuczma [54], 
and Chapter 7. 

Simpson’s Rule is an elementary numerical method for evaluating a definite in- 
tegral |, : f (t)dt. The method consists of partitioning the interval [a, b] into subin- 
tervals of equal lengths and then interpolating the graph of f over each subinterval 
with a quadratic function. If a = x9 < x1 < x2 <-+++ < X2, = bisa partition of 
[a, b] into 2n subintervals, each of length Pa then 


b b— 
‘) f(t)dt = Fao) +4 f (x1) +--+ +2f on—2) + 4f an-1) + f On]. 


This approximation formula is called Simpson’s Rule. It is well known that the error 
bound for the Simpson’s Rule approximation is 


2 b-a 
/ f (dt Se Te Ora Ga) 


k(b—a) 


+ 2 f (x2n—2) — 4 f (X2n-1) = Ff (x2n)] S ~ 180n4 ” 


where k = sup{| f“(x)| | x € [a, b]}. It is easy to note from this inequality that if 
f is four times continuously differentiable and f(x) = 0, then 


b b— 
i f(t)dt = If Go) +4 f (x1) +-+-+2f on—2) + 46 Gon-1) + f 2n)]- 
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This is obviously true if n = 1, and it reduces to 


[ f (dt = —“[f (x0) +4 f 1) + f@a)]. 
Lettinga =x, b=y,andx,; = > in the formula above, we obtain 
(SR) a f(t)dt = =| reo+ar (Se *) + Io a 


This integral equation (SR) holds for all x, y € Rif f is a polynomial of degree 
at most three. However, it is not obvious that if (SR) holds for all x, y € R, then 
the only solution f is the cubic polynomial. The integral equation (SR) leads to the 
functional equation 


(17.45) g(y)— g(x) = | Fw +4 ( *) + fo), 


where g is an antiderivative of f. The equation above is a special case of the func- 
tional equation 


(17.44a) f(x)— gy) = @ — y)[ha& + y)+ y@)+¢40)] 


for all x, y € R. Now our goal is to determine the general solution of the functional 
equation (17.44a). The two functional equations 


(17.46) g@)-s@)=@—-y)f@t+y)+@+y)f@—y), forallx,y ER, 


and 


(17.47) xf(y) — yf x) = @& — y)lg@ + y)— g@%)—gQ)],  forallx,y eR, 


are instrumental in solving the functional equation (17.44a). 


17.6.1.2 Solution of the Functional Equation (17.46) 


Result 17.17. The functions f, g : R — R satisfy the functional equation (17.46) if 
and only if 


(17.46a) f(x) =ax*+ A(x), g(x) = 2ax4+2xA(x) +), 


where A is additive and a and b are arbitrary constants. 


17.6.1.3 Solution of the Functional Equation (17.47) 


The general solution of the functional equation (17.47) without any regularity as- 
sumption on the unknown functions is given by the following. 
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Result 17.18. The functions f, g : R > R satisfy the functional equation (17.47) if 
and only if 


(17.47a) f(x) = 3ax°+2bx*+ex+d, g(x) = —ax? — bx? — A(x) —d, 


where A is additive and a, b, c, and d are arbitrary constants. 


Using Result 17.18, we solve the functional equation 


(17.48) f@)- gO) = —-y)A@t+y)+k@)+kQ)), x,y ER. 


Theorem 17.19. (Kannappan [475]). The functions f,g,h,k : R — R satisfy 
(17.48) if and only if 


fF) = 3ax* + 2bx? + cx? +. dx +a = g(x), 
(17.48a) h(x) = ax3 + bx? + A(x) +d —- 28, 
k(x) = 2ax? + bx? +. cx — A(x) + B, 


where A is additive anda, b, c, d, a, and f are arbitrary constants. 


Proof. Letting x = y in (17.48), we see that f(x) = g(x). Hence (17.48) reduces to 
(17.49) ff) — FO) = & — yhA@ + y) +k) + ky). 
Putting y = 0 in (17.49), we obtain 
f(x) = FO) + x[A(x) + kx) + k(O)]. 
Putting this equation into (17.49) and rearranging, we obtain 
(17.50) y[h@e) +k(@x)]—x[A(y) +k(y)] = @& — yA +y) -A(xe) —h(y) — KO). 
Defining 
(17.51) d(x) := A(x) +k) and g(x) :=—h(x) —4(0), 
and using (17.51) and (17.50), we have 
(17.47) xP(y) — yO(x) = & — y)[g@ + y) — 8%) — 8)] 


for all x, y € R. The general solution of (17.47) can be obtained from Result 17.18 
as 


(17.5la) b(x) = 3ax2+4+2bx*+cx+dy and g(x) = —ax?—bx?— A(x) —do, 


where a, b, c, and do are constants. From (17.51) and (17.51a), we obtain 


(17.51b) K(x) = (x) + g@) + B, 
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where £ = k(0). Now, using (17.51a) and (17.51b), we obtain 
(17.51c) k(x) = 2ax* + bx* +.ex — A(x) + B. 
Again, from (17.51) and (17.51a), we have 
h(x) = ax? + bx* + A(x) +d — 28, 
where d = do + f. Thus we get 


f (x) = 3ax4 + 2bx3 +. cx? +.dx +a, 


where a = f (0). The proof of the result is now complete. 


Remark. It follows easily from Theorem 17.19 that the solution of (17.45) is g(x) = 
3ax* + 2bx? + cx? +.dx +a and f(x) = 12ax3 + 6bx? + 2cx + d, as predicted. 
Note that the general solution is obtained without any regularity assumptions of f 
and g. 


Now we prove our main result, the solution of (17.44a). 
Theorem 17.20. (Kannappan [475]). The functions f, g,h,¢, y : R > R satisfy 
the functional equation 
(17.44a) fx) — 80) = @ — yh + y) + O@) + YO)I 
forall x, y € Rif and only if 


f (&) = 3ax* + 2bx? + cx? +dx +a = g(x), 
h(x) = ax? + bx* + A(x) +d —- 28, 

(x) = 2ax? + bx? +x — A(x) + B+, 
y(y) = 2ay? + by* +cy -AQ) + B-», 


(17.44c) 


where A is an additive function anda, b, c, d, a, fh, and y are arbitrary constants. 


Proof. First, letting x = y in (17.44a), we see that f = g. Interchanging x with y in 
(17.44a) and using the fact that f = g, we get 


(17.52) fQ)— f@) = —x)A@ + y)+ 40) + w@)]. 


Adding (17.52) to (17.44a) and using f = g, we get w(x) — d(x) = wiv) — dQ) 
for all x, y € R. Thus 
w(x) = p(x) —2y, 


where y is an arbitrary constant. Putting this ¢ into (17.44a), we have 
(17.48) f(x) — gy) = & — yh + y) + O() + 60) — 271. 


Using Theorem 17.19, we have the asserted solution (17.44c). This completes the 
proof. 
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Now we proceed to find the general solution of (17.44b) with no regularity as- 
sumption imposed on f, g, and h. The solution of this equation will be used to 
determine the general solution of the main functional equation (17.44) of this sec- 


tion. 


Result 17.21. [445]. Let s and t be real parameters. Functions f, g,h :R— R 


satisfy the functional equation (17.44b) for all x, y € R if and only if 


ff) = 


g(x) = 


h(x) = 


where A : R —> R is an additive function and a, b, c, d, a, and B are arbitrary 


real constants. 


The following result is obvious from Result 17.21, and this result was established 


ax*+(b+d)x+c 
ax*+(b+d)x+c 
ax*+(b+d)x+c 


3ax4 + 2bx3 + cx* + d+2h)x +a 
Dax? +-ex* 4 264 = AW) +a 


ax*+(b+d)x+c 
ax + § 
ax + 
ax +4 


2ax3 + bx? +cx — A(x) + B 


3ax* +ex+ 
ax +8 


arbitrary with h(O) = d 


ifs =O0=t 

ifs =0, t 40 
ifs £0, t=0 
ifs =t #0 

ifs =—-t #0 
SOAs? #1 £0, 
ifs =O0=t 

ifs =0, t £0 
ifs £0, t=0 
ifs =t 40 

ifs =-t £0 
SOAs £1? £0, 
ifs =O=t 

ifs =0, t #0 

ifs £0, t=0 

ifs =t #0 

ifs =-t #0 
f0A2 £2? £0, 


in [519] to answer a problem posed by Rudin [716] (see also Chapter 7). 


Corollary. Let s and t be real parameters. Functions f, g,h : R — R satisfy the 


functional equation 


f(x) — gy) = & — y)A(sx + ty) 


for allx, y € Rif and only if g(x) = f («) and 
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dx +c ifs =O0=t 

dx +c ifs =0, t #0 

dx +c ifs £0, t=0 
f@)= 2 ? 

cx° +dx+a ifs =t #0 

a — A(x) ifs =-t £0 

dx +c ifO As? #t? £0, 

arbitrary withh(0)=d ifs =0=t 

d ifs =0, t £0 

ifs £0, t=0 

Ax)=4 ea 

“+d ifs =t £0 


-4A(*), x40 ifs =-t £0 
d if0O As? £1? £0, 


703 


where A: R — R is an additive function and c, d, and a are arbitrary constants. 


17.6.1.4 Solution of the Main Functional Equation (17.44) 


Now we proceed to determine the general solution of (17.44). 


Theorem 17.22. [475]. Let s and t be real parameters. Functions f,g,h,¢, w : 
R => R satisfy the functional equation (17.44) for all x, y € Rif and only if g(x) = 


Ff (x) and 


f(x) = 


p(x) = 


ax? + (b+d)x +c 

ax? + bx +e 

ax? + bx +c¢ 

3ax* + 2bx3 + cx* + (d+2f)x+a 
2ax3 + cx? + (28 — d)x — A(x) +a 
ax*+(b+B+d)x+c 


ax + 2 
2ax? + bx? + cx —A(x) + B+ 4 
3ax* + ex — 5Ao(x) + B 
ax-A(#4) 4 p45 


ifs =O0=t 

ifs =0,t #0 
ifs £0, t=0 
ifs =t #0 

ifs =—-t #0 

f0 As? #7 £0, 
ifs =O=t 

ifs =0, t 40 
ifs £0, t=0 
ifs =t #0 

ifs =-t #0 
fOA2 AP £0, 
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ax + G40) ifs=O=t 

ax + © _ pngx) ifs =0, t £0 

ax + &9) _ n(sx) ifs #0,t=0 
VO) = Voax3 + bx2 + cx — AQ) + 8-8 ifs =1 #0 

3ax? + cx + ZAo(x) + B-d ifs =—t £0 

ax— (rs) +4 POA #P £0, 

arbitrary with h(QO) = d ifs =O=t 

arbitrary ifs =0, t #0 
arbitrary ifs £0, t=0 
= Vals) +b (EV +A (8) +4 hale 

re) , ; 
-a (8)? —£4 (4) +4400), x #0, ifs=-1 #0 
A(ts) +d f0 A s* At” £0, 


where Ap, A : R — R are additive functions and a, b, c, d, a, f, and 6 are 
arbitrary real constants. 


Proof. Letting x = y in (17.44), we see that 
f (x) = g@) 
for all x € R. Hence (17.44) becomes 
(17.53) f(x) — Ff) = @ — y)[h(sx + ty) + os) + vO)I. 


Interchanging x and y in the equation above and adding the resulting equation to it, 
we have 


(17.54) h(sx + ty) + d(x) + wy) = h(sy + tx) + b(y) + vw) 


for all x, y € R with x # y. But (17.54) holds even for x = y also. 

Now we have to consider several cases: s = 0 = t; 5 = 0, t £0; ands # 
0, +40. 

The case s = 0 = ¢ yields (x) = w(x) = 6 and 


f(x) — Ff) = @ — y)AGx + ty) + w@) + w(y) - 4). 
Hence, by Result 17.21, we obtain the asserted solution 


f@)=ax?+(b+dxte, ga) =f), 
oe) sart 25", yo) sax $28, 
h(x) arbitrary with h(0) = d, 


where a, b, c, d, and 0 are arbitrary constants. 
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Consider the case where s = 0 andt # 0. (The case where s ~ 0 and t = O can 
be handled in a similar manner.) Then, for this case, from (17.54), we have 


(ty) + wy) — 6) = hex) + v@) -— 6), 
for all x, y € R, and using Result 17.17, we have the asserted solution 


f(x)=ax?+bxte, g(x) = f(x), 
oe) sart2F 9 yey sat 25" — nen, 


h(x) arbitrary, 


where a, b, c, d, and 0 are arbitrary constants. 
Now we treat the case where s 4 0 andt 4 0. Suppose s 4 0 ¢ t. Next we have 
to consider several subcases. 


Subcase 1. Suppose s = t. Then, from (17.54), we get 
A(tx + ty) + Ox) + w) = Alty + tx) + 60) + w@). 
Hence, we have (x) = w(x) — 0, and from (17.53) we obtain 
f(x) — f(y) = @ — yx + ty) + wx) + wy) - 4). 
From Result 17.21, we obtain 
f (x) = 3ax* + 2bx? +x? + (d+ 2f)x +a = g(x), 
b(x) = 2ax? + bx* +x — A(x) +B + 7 


0 
y(x) = 2ax? + bx* + cx —-A(Qx)+f--= 


3° 
h(x) =a (*) +b Cy +A (=) +d, 
t t t 
where a, b, c, d, a, #, and 6 are arbitrary constants and A an additive function. 
Subcase 2. Suppose s = —t. From (17.54), we have 
A(ty — tx) + 6%) + wy) = h(t — ty) + 0) + w@) 
for all x, y € R. This in turn yields 
h(tx — ty) —h(ty —tx) = H(x~) — H(y), 
where H (x) := $(x) — w(x). Letting x = 0 in the above, we observe that 


h(-ty) —h(ty) =d— H(y), 
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where d = H (0). From these, we have 
A(x—y)+d=H(x)+d-—H(y)-d; 
that is, H (x) + d is additive on the set of reals. Hence 
w(x) = P(x) + Ao(x) —d, 
where Ag : R — R is an additive map. Substituting this into (17.53), we get 
fe) — FO) = & —A)IhAGy — tx) + 6) + 609) + Aoly) — dl, 


which is 
F(x)- FQ) =@— y)[K(tx — ty) + ®@) + O()], 


where 
1 1 
FQ) =f@)+td, K(e)=hC-tx)— 5Ao(x), O) = d(x) + 5400). 
Thus, from Result 17.21, we again have the asserted solution 
f@= 2ax? + cx? + (28 —d)x —A(x)+ a= g(x), 
1 
P(x) = 3ax? + cx — zAot) + £, 
1 
w(x) = 3ax? + cx + 7 Ao) + f-d, 
x Bs 


raya -o(f) fa) faut, 0 


where a, b, c, d, a, and f are arbitrary constants and A, Ag : R > R are additive 
functions. 


Subcase 3. Finally, suppose s? 4 17. Letting y = 0 in (17.54), we get 
(17.55) h(sx) + d(x) + wO) = h(x) + 6) + w(x). 
Letting (17.55) in (17.54) and simplifying, we have 
(17.55a) h(sx + ty) —h(sx) — h(ty) = h(sy + tx) — h(tx) — h(sy). 
Interchanging tx and sx in (17.55a), we obtain 

h(tx + ty) —h(tx) — h(ty) = h(sy + sx) — h(sx) — h(sy). 


Replacing x by + and y by 2 in the above, we get 


A(x t+y)—h(x) -—hQ) =h (“) =i ia =a (+) 
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which, by defining H (x) := h(x) — h (*), reduces to 
HA(x+y)=H(x)+A(y). 


Thus ee 
h(x) —h (=) = At), 


where A : R — R is an additive function. From this, we have 


(17.56) h(tx) — h(sx) = A(tx). 
Putting (17.56) into (17.55a), we get 
(17.57) h(sx + ty) — A(ty) = A(tx + sy) — A(tx). 


Substituting u = sx + ty and v = tx + sy in (17.57) and simplifying, we get 


nw) -a(“*) = hv) —a( 2 ) 
t= S f= 


for all u,v € R. Hence 


(17.57a) h(u) = A (. e ) 4d, 


—s 
where d is a constant. From (17.55) and (17.57a), we obtain 
(17.57b) (x) = wx) + Alte) + B, 


where f := $(0)— yw (0). Putting (17.57a) and (17.57b) into (17.53) and simplifying, 
we get 


G G 
POSE GESG—s pene) 


2 
where 


2 
t 
(17.57c) F(x) = f(x) — fx —dx and G(x) =2y(x)+2A (=) 
—s 
Hence, by (17.57b), (17.57c), and Result 17.17, we have the asserted solution 
f(x) =ax*+ b+ B+d)x+c= 8), 


b 
oy =ar-a(- Jeoed. 


STX 


=S 


where a, b, c, d, and f are arbitrary constants and A : R — R is an additive 
function. 


Since no more cases are left, the proof of the theorem is now complete. 
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17.6.2 Determinants 


Let X € Minn (F), where F = R or C. Denote the jth column of X by x; and the 
ith row of X by x‘, so that X can be written as 


X= (xij =([*1,...,4n] = Gel sce), 


The well-known Binet-Cauchy theorem for matrices X € Minn(F), Y € 
Mnm(F) takes the following forms: 


(i) Ifn <m, then det(XY) = 0. 
(ii) If n = m, then det(X Y) = det X det Y. 
(ii) If n > m, then 


det(XY) = by det[x;,,..., x4, ] det(y*!,..., y*). 


1Skj S--Sky <n 


The fact that the determinant is a multiplicative function on the set of all matrices 
of order n turns out almost to characterize determinants; still it does not characterize 
them completely. It is shown in [500] that the general Binet-Cauchy theorem (iii) 
characterizes the determinant completely. More precisely, the following theorem is 
true. 


Theorem 17.23. (Kucharzewski [546].) Jf f : Mm(R) — R is a nontrivial function 
with the property that 


(17.58) ‘Hs SS (Cente ..F*) 


1<k, <-+-<km<n 


holds for any matrix X € Mnn(R) (m <n < m+ 1) and any matrix Y € My (R), 
then 


f(C) = detC 
is true for any C € M,,(R). 


Proof. Two proofs are presented in [546], and we present the following one. For 
n =m, from (17.58) it follows that f is multiplicative; that is, 


(XY) = f(X)FW), for X,Y € Mn(R). 
According to Results 2.11 and 2.12, we get 
f(C) = M(det C) 


for every matrix C € M,,(R), where the function M is multiplicative (M) on R. 
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Now, take 
P+ O90 aa ae 
01---00 
X= | 2: a and Y= 
Sona C004 
00-:--0Os 


and observe that det([X;,,..., Xk,1) (i < ki < +++ < km <n) equals ¢ if ky = 
1l,...,km = morky = 1,...,km-1) = m—1, km = m+ 1. Otherwise, this 
determinant vanishes. Using this, (M), and (17.58), we find 


f (XY) = (m — 1)M(0)? + M(t)M(1) + M(t) M(s). 
On the other hand, det(XY) = t + ts together with (M) gives 
(17.59) M(t +ts) = (m—1)M(0)+ M(t) + M(ts). 


If in (17.59) we set t = s = 0, we get mM (0) = 0; ie., M(0) = 0. 
Replacing s by ¢ in (17.59), we get that M is additive. M being additive and 
multiplicative on R implies (see Proposition 1.12) 


M(t) =t. 


Thus f(C) = M(det C) = det C for any C € M,,(R). 
This proves the theorem. 


Corollary. For every real matrix X of order m, we have det X' = det X, where X' 
denotes the transpose of X. 


Proof. For C € M,,(R), set 
f(C) = detC’. 


It is easy to see that f satisfies all conditions of Theorem 17.23. For example, 
for matrices 


il 1 
X= [X1,.--,Xm,Xm41], Y= (y Oe ae or De 


we have 


f (XY) = det(XY)' 
= det(Y'X’) 


ISky <++<km <m+1 | Vkym ++ * Vkq_ im Xkm °° XmKkm 


D>, fC DIC a). 


1<k <+++<kyp<m+1 


II 
iy] 
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Since f(J) = detI’ = det 7 = 1, we conclude that f(C) = detC. Thus detC = 
det C‘ for any C € M,,(R). 
This concludes the proof of the corollary. 


Remark. The Binet-Cauchy theorem can be proved by straightforward computations 
using determinant functions. A determinant function A(x1,..., Xm) 1s a function 
of m vectors defined on an m-dimensional vector space V subject to the following 
conditions: 


(i) A is linear with respect to the first argument; i.e., 
A (ax + a’x},X2, shige: »Xm) = aA(x1, x2, shies: »Xn) - a’ A(x}, x2, oes »Xm)s 


xi eV. 
(ii) A is skew-symmetric with respect to all arguments; i.e., 


AC... Xk Xkt1y +) = HAC... X41, %%,---) for] <k <m. 


For more characterizations of determinants, see Gaspar [301]. 


17.7 Digital Filtering 


Computer vision is a relatively new and fast-growing field that primarily aids com- 
puters with artificial sensory perception. In other words, computer vision deals with 
image understanding. Preprocessing is a basic task in every image understanding 
problem. Among other things, preprocessing involves filtering. Spatial filters are 
used for deblurring, smoothing, sharpening, and enhancing images. Thus filtering is 
a technique for modifying or enhancing an image. 

A digital image is a function f : Zn, x Zn, — Ze, where nj, nz, and ¢ are 
positive integers. Here ¢ is a parameter that indicates the number of gray levels used 
in representing an image. If the function f denotes an image, then a spatial filtering 
of f would be 


(17.60) 
f@,y)=f@+ty+t+f@—-t,y)+ fa,y—t) 
f(x t,y t) fax,y+t)—f@t+t,y) 


for all (x, y) € Zn, x Zn,. The effect of this filter on a test image is shown in the 


figure below. The variable f is preset to 1. The filtered image f retains certain 
information contained in the original image. 


Figure: Original (left) and filtered (right) images of Saturn 
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A particular filter rarely produces good results for all images. For some images, 
a filter will produce a better result in terms of retaining certain basic information. 
For other images, it may not yield better results. Thus it will be useful to know on 
what kinds of images a particular filter gives the worst result (that is, it makes the 
image completely dark). To determine the class of such images with respect to the 
filter described in (17.60), one encounters the functional equation 


fa+t,y+t)+f@—t,y)+ f@,y—-t) 


(17.60a) 
=f@=—tiy -—H+/@,y+D+/044,y) 


for all x, y,t € Z,. The goal now is to study the functional equation in an abstract 
setting. 

The functional equation (17.60a) also appeared in [513]. In that paper, it was 
shown that if the quadratic polynomial f(x, y) = ax? + bx +cy*+dy +exy+a 
with a £ 0 and c # 0 is a solution of the functional differential equation 


f@th, y+k)—f (x,y) =hfr(x+0h, y+Ok)+kfya~t+Oh, yt+0k), O<0<1, 


for all x, y,h,k € R with h? +k? # 0, then? = 5. Conversely, if a function 
f satisfies the equation above with 6 = 5; then the only solution is a quadratic 
polynomial. Here f; and f, represent the first partial derivatives of f with respect 
to x and y, respectively. 


To establish this result in [513], the authors used the functional equation 


iO FOS IO Hy) +I ey =D) 


17.60b 
MEEO0n) = f@=—s,y-)+f@.y+D+f@+5,y) 


for all x, y,s,¢ € R. Letting s = f in the equation above, we have the previous 
equation (17.60a) holding for all x, y,t € R. Since (17.60b) implies (17.60a), the 
solution of the functional equation (17.60b) can be obtained through the solution 
of the equation (17.60a). The general complex-valued solution of (17.60a) when 
(17.60a) holds for all x, y, and ¢ in a 2-divisible Abelian group G is given in the 
following proposition. 


Proposition 17.24. The function f : G x G — C satisfies the functional equation 
(17.60a), where G is a 2-divisible Abelian group if and only if 


(17.60c) f(x,y) = Ba, y) + O@) + wy) + x — y) 


holds for all x, y € G, where B: G x G > Cis biadditive and ¢,y,4:G—>C 
are arbitrary functions. 
17.8 Geometry 


Let M, be the family of all straight lines in the plane IR? that are parallel to the 
y-axis and of all curves of the form y = p(x + a) + f, where p is a fixed function 
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and a, / run over R. In Faber, Kuczma, and Mycielski [265], it is proved that if 
there exists a continuous bijection of R* onto R? that induces a map of the family of 
all straight lines onto M », then p must be a polynomial of degree 2. 


Proof. We present the proof in [265]. Since the set {(x, y) : y = p(x)} 1s the image 
of some straight line, by the homeomorphism ¢, the function p must be continuous. 
It is shown in [325] where this result is proved under the additional condition that p 
is differentiable, that there exists a constant r such that g(x) = p(x +71) satisfies the 
functional equation 


q(x) + g(t) —q(« +t) —q(0) = (0) — gq +2) — 4g) + 4). 


Neither p nor q can be of the form ax + b. In fact, if p(x) = ax + b, then M, 
consists of all vertical lines and all lines with slope a. But then not every pair of 
points of IR? lies on one curve of M p- 80, M, could not be isomorphic to the 
family of all straight lines. 

Since the left side of the equation above is symmetric in x and f, interchanging 
x and ¢ on the right side and then putting x = | yields 


qd) —qU +t) —q()+ q(t) = 2ct 


for some constant c. Thus 


q(x) + q(t) — q(x +t) — q(0) = 2ctx 
sel +1) —x°— 27), 


which is additive in A(x) = g(x) — q(0) + cx”. Thus g(x) = —cx? + ax 4+ d. This 
concludes the proof. 


Let VV; consist of all planes parallel to the z-axis and all surfaces of the form 
c= pat+ayt+f)+y, 


where p : R* — Randa, £, and y are real constants. In [265], the following 
problem was raised: Characterize those functions p for which NV’, is continuously 
isomorphic to the family of all planes in R*. This problem is solved through a func- 
tional equation and shows that p is a polynomial of degree 2. 

Let P denote the family of all planes in R*. An isomorphism from P onto N, pisa 
bijection of R? that induces a bijection from P onto Np. Similarly, an automorphism 
of P is a bijection of R? that induces a bijection from P onto P. 


Lemma. Every bijection ¢ of R? onto itself that induces a bijection of P onto itself 
is affine linear; that is, 


P(x, y,Z) = (aaxt+byy+c1z +u, ax + boy +022 +0, 43x + b3y +.43z + w), 
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where 
ay by cy 
det | az bo c2 # 0, 
a3 b3 c3 


and a;, bj, and cj (i = 1, 2,3) are arbitrary constants. 
This result is known as the “Fundamental Theorem of Projective Geometry”. 


Result 17.25. Let p : R? — R. There exists a continuous isomorphism 6 from P 
onto Np if and only if p(x, y) = dx? + doy” + 2d3xy + dax + dsy + do with 
ddz — d3 + 0, where dj (i = 1,2,..., 6) are real constants. 


17.9 Field Homomorphisms 


We treated the problem in Chapter 1 whether for A : R — R additive A(x)A (+) > 
0 for x € R* implies the continuity of A. In [131], it is claimed that if an additive 
A: K — K (K afield with char(K) ¥ 2) satisfying A(x)A (+) = | forx € K* is 
a field monomorphism. Halter-Koch and Reich [337] considered the characterization 
of additive functions satisfying a functional equation of the form 


A ax+b\  aAQ)+fP 
(=**) ~ pA(x) +6" 


(17.61) 


Result 17.26. [337]. Let K and K be fields of characteristic different from 2, d\ € 
K*, di € Kk, and let A: K — K be an additive function satisfying 


d d 
(17.61a) A(—)=—  porall x € K\ker(f). 

x A(x) 
Then either A = O ore = A(1) ¢ 0, e-!A: K — K isa field monomorphism and 
d; =ef(d). 
Theorem 17.27. [337]. Let F be a field of characteristic different from 2, #F > 5, 
and let K and K be extension fields of F. Suppose that 


ab ap 
(: 2 € GL, K), 6 ) éGLOK), 


and let A: K — K be an F-linear map satisfying the functional equation (17.61) 
forall x € K such that cx +d 4 Oand y A(x) + 6 £ 0. Then one of the following 
three assertions holds true: 


(i) A =Oand Bb =0. 


(ii) c=y =0, A (4) = gE and A (Sx) = 5A(x) forallx € K. 
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(iii) cy £0, AU) =e £0, e-!A: K — K isa field monomorphism, and there 
exists some 2 € K™ such that 


a B\ _ (de A(a) Ae? A(b) 
y OJ \ AA(c) AeA(d) )° 


Proof. We consider the case cy # 0. We may assume that c = y = 1, and then we 


must prove that 
aPp\ — (A(a) cA(d) 
lo} 1 A(d))° 


If x € K is such that x +d 40, and A(x) + 6 ¥ 0, then (17.61a) implies 


d\ d\ 


Putting y = x +d, we obtain, for all y € K* such that A(y) — B(d) + 6 £0, 


dy dy 
a@—a(4) =a AQ) —A@) +0 


If F, = {A € F* | AA(yo) — A(d) + 6 S$ O},—" then #F > 3, andif y = Ayo for some 
A € Fj, then 

1 (ad d\ 
A(a)-— —A (7) =a- — 
A \yo AA(yo) — A(d) +. 6 


which implies 


A(yo)(A(a) — a) A? + aw + 6)(A(a)—a)—A (- ) A(yo) + a A 


= 
YO 
d 
+A (*) (A(d) — 6) =0 
YO 
for all J € F,. Hence, we obtain 
A(yo)(A(a) — a) = 0, 
d = 
(-A(@d) + d(A@-a)—A (*) A(yo) + di =0, 
d 
A (*) (A(d) — 6) =0, 
yO 
and consequently, since A(yo) 4 0, 
A(a) =a, 


d _ 
A (=) A(yo) = 4d #9, 
Yo 
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and hence 
A(d) =o. 


Therefore A satisfies the functional equation (17.61a) for all y € K\ ker(f). Hence, 
by Result 17.26, e~!A : K — K isa field monomorphism, d; = ef (d1), and using 
a= f(a), 6= f(d), we obtain also 6 = ef (b), which finishes the proof. 


Corollary. [f, under the conditions of Theorem 17.27, we assume in addition 
cy #0 and A()=1, 


then an additive function A : K —> K satisfies (17.61) if and only if A is a field 
monomorphism. 


A Generalization of (17.61) 


Result 17.28. Let K be a field containing Q, n € Z., n = 2, 


ab 
é 4 €GL(,K), 


and eitherc = Oorc £0, (c-!d)* = a" for some a € K. Let A,, A2: K — K be 
additive functions such that A is injective and 


(= + 7) _ aAi(x)" +b 


17.61b = 
( ) cx" +d cA, (x)" +d 


holds for all x € K such that cx" +d #0 and cA,(x)" +d 4 0. Then Ay(1)7!Aq : 
K — K isa field automorphism. 


17.10 Pythagorean Functional Equation 


We consider now the functional equation 
(17.62) FO? +y)= FY + fOY, 


known as the Pythagorean Functional Equation, where f stands for a map of the 
finite field F, of g elements (¢ = p” for some prime) into itself. As multiplication 
in F, may be obtained from both squares and addition by polarization whenever the 
characteristic p of Fg is not 2, we obtain as nontrivial solutions of (17.62) exactly 
the Frobenius automorphism together with its powers if p 4 2. The two functional 
equations required for a field isomorphism (one for addition and the other for mul- 
tiplication) may be simplified to just one: functional equation (17.62). But, for the 
sake of completeness, we also include the solutions of (17.62) when p = 2, which 
turn out to be simpler to find out due to the fact that in this case the map x +> x? is 
bijective (the Frobenius automorphism). However, the results for p 4 2 no longer 
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hold if p = 2, as one expects from the fact that the polarization identity requires 
dividing by powers of 2, and thus the multiplicative structure is not recovered from 
squares and addition. 

The solution of (17.62) is obtained via the solutions of the restricted Cauchy 
functional equation 


(17.62a) FQ? +y?) = fa?) + FO?) 


for f : Fy — F,. Equation (17.62a) is considered here because there is a trivial, but 
useful, connection between (17.62a) and (17.62), which allows us to proceed solving 
first (17.62a) and then (17.62). 


Result 17.29. (Garcia-Roig and Salillas [300]). Let p be an odd prime number and 
q = p", n = 1. The functional equation (17.62a) for f : Fg — Fg has exactly q" 
solutions of the form 


2 n—1 
f(x) = ax + apx? + a,x? ita gaia” : 


where a, Ap, ..+5@,n-1 are in Fg. 

Theorem 17.30. [300]. Let p be an odd prime number and q = p", n > 1. The 
functional equation (17.62) for f : Fg — Fg has exactly n + 2 solutions, namely the 
constant functions f =0, f = 5 and the powers f (x) =x? (i =0,1,...,n—1) 
of the Frobenius automorphism of F. 

Proof. Substituting x and y by zero in (17.62), we obtain f(0) = 2 (0)? so that 


f@) =0or, if p > 2, FO) = +. 
Let us assume first that f(0) = 0. Then, setting y = 0 in (17.62), we see that 


fa =7ay 


and thus (17.62) entails (17.62a), so that the solutions for (17.62) are to be found 
among those described in Result 17.29. Now taking a solution f(x) induced by the 
(reduced) polynomial 


1 


F(T) = aT +apT? tayT” +---+ay1T? 


of (17.62a) with a}, ap,a » A yn-1 in Fg, let us impose what amounts to 


p2r++9 4p 
F(T’) = F(T)? 


since both polynomials are already reduced. Expanding, we get 


n—1 

ayT* + pT? +--+ + ayn-1T7? 
n—1 i i 
Sail? +a, tesa TP +2 >) aay tl, 


O<i<j<n 
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and from this we conclude that there cannot occur two (or more) nonzero coefficients 
in p(T): Ifi < j anda,ia,; 4 0, then the term of degree p’ + p/, which appears 
to the right of the equation above, does not appear on the left (observe the argument 
would fail in characteristic 2). Thus p(7) is monomial, say apyiT? (0 <i <n). But 
then, going back to the equation above again, we see that a, = Api; 1.€.,a4,i = O or 
1 (which actually yield solutions of (17.62)), and this concludes this case. 

Assume now that f(0) = 5. In this case, obviously f(x*) = f(x)? + 7 and 
if we set g(x) := f(x) -— 5; then f being a solution of (17.62) entails that g(x) 
satisfies (17.62a). This g may be thought to be induced by the polynomial g(T) = 


aT + apT? +--+ + ayn-1 TP" of F,[T]. But now q(T) is subject to the relation 


Pp 


q(T?) = q(T +4(7) 
since 
e007) = f0)— 5 = fe) + 7-5 = FOP -4 


= 1 . te 2 
=(«) +5) = Bis) + g(x). 


Expanding the equation above and proceeding in a similar way as with the case 
f (0) = 0, we immediately obtain first that g(7) must be a monomial and then that 
it must be the zero polynomial. This proves the theorem. 


The Case of Characteristic 2 


This case turns out to be simpler than that of the odd characteristic (and note that the 
preceding methods are still applicable). Precisely, for (17.62a) there is an immediate 
argument, and then the further step to get the solutions of (17.62) is straightforward. 


Result 17.31. [300]. The functional equation (17.62a) for f : Fan — Fan, where 
q = 2" (n > 0), has q” solutions, namely the polynomial mappings 


n—1 


2 
f(x) = ax + apx? +a,2x?P fs 9 sb gnats” > 
where a1,dp,..-, Apyn-1 are (arbitrarily chosen) in Fy. From these, the 2" maps 


where the coefficients a, dp,...,Apn-1 are required to lie in the prime field F2 are 
all the solutions of the functional equation (17.62). 


17.11 Statistics 


This section is devoted to characterization of some statistical distributions such as 
the Poisson, gamma, and normal distributions. 
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17.11.1 Poisson Distribution 
17.11.1.1 Characterization of (Bivariate) Poisson Distributions 


Suppose that items of two types have been observed a certain number of times, but 
these original observations have been reduced due to a destructive process that is 
the product of two binomial distributions and that the probabilities of these reduced 
numbers are the same whether damaged or undamaged. Then the original random 
variables had a bivariate Poisson distribution with zero mutual dependence coeffi- 
cient. 

Suppose m and n are the numbers of items of two types observed originally. 
Suppose further that these original observations are reduced to r and s due to a de- 
structive process that is the product of two binomial distributions; i.e., the probability 
that (m,n) is reduced to (r, s) is 


m = n = 
( ora py" (aa =a *, (p,q € Io). 


r 
Suppose finally that for the resulting distribution 
P(r,s) = P(r, s | undamaged) = P(r, s | damaged). 


Then the original random variables had a bivariate Poisson distribution with zero 
mutual dependence coefficient (that is, they had the distribution of two independent 
random variables obeying Poisson laws). 

In Skitovich [748], this problem is reduced to find the solution of the functional 
equation 


(17.63) G(pz+1—p,qwu+1—4q)G(p,q) = G(pz, qu) 


concerning the joint generating function G of the original random variables. 
Theorem 17.32. [59]. Let G ¥ 0 satisfy the functional equation (17.63) for all 
P.¢ € 10, 1[, z, w € |] — 1, 1[. Then, and only then, 

(17.63a) G(z, w) = exp[A1(z — 1) + Ao(w — 1)] 


forallz,w €]—1,1[, where A, and A2 are arbitrary solutions of (A) on {(z1, Z2) | 
-l<z <0, —2 <z <0, —2 < 2z14+ 22 < O}. 
Further, if G is continuous in a point (of | — 1, 11), then, and only then, 


(17.63b) G(z, w) = exp[/1(z — 1) + 42(w — 1)) 


holds with arbitrary constants 2, and A. 


Proof. The “only then” statements are obvious by direct substitutions. In order to 
prove the “then” parts, introduce new variables in (17.63). Setx = p—1, y= 
q-1, u = pz—p,andv = qgu—q,sothatp=x+1l1,q=yt+l1,z= 
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ss =yitlwe= ut = sit 1. (p # O and gq F 0 by supposition, so 
x 2% =1, y #=1), and 


pz=u+pH=utx4l1, qu=vt+q=o+yH1. 


Then (17.63) goes over into 
(17.63c) F(x, y)Flu,v) = Fx +u,y+vd), 
where 
F(lu,v) =Gu+1,v0+1) 
for p,g € Io, z,w €]—1,1[, x, y € ] —1, O[ and uw and o through | — 2, O[, while 
2x +u = p(z+1)—2and2y+v0=q(w + 1) — 2 are restricted to |] — 2, O[. 


The general, not identically zero, solution of (17.63c) is of the form (17.63a) (see 
[12, Section 5.1.1]). This proves the theorem. 


17.11.2 Normal Distribution 
(i) (Kac [408]). Suppose ¢ : R = C satisfies the functional equation 
(17.64) d(x) = V2a0d¢(ax)h(bx), forx ER, 


with [ #(x)dx = 1, and ¢’(0) exists where o,a,b > 0 and a? + b” = 1. Then ¢ is 
a normal distribution 


I cies 
(17.65) p(x) = V2a-—e-*/* forallx ER. 
Oo 


(ii) (Baker [93], Laha and Lukacs [593]). If we set f(x) = V/2a0¢(x) for x € R, 
then (17.64) is equivalent to the functional equation 


n 


(17.64a) f(x) = [LPG 


j=l 


studied in [593] to characterize normal distributions, withn = 2, fj =a, fo = 
b, yp =1=y2, and By; + Bry2 =i. 
Let F : R — R be a distribution function that is a nondecreasing, right 
continuous function such that ; lim F(t) = 0 (F(—oo) = 0) and jim F(t) = 
—>—00 00 


1 (F(+00) = 1). The Fourier-Stieltjes transform 


[o,@) 
f(t) = } el" F(x) 
=00 
of F(x) is called the characteristic function of the distribution function F (x). A char- 
acteristic function is always a continuous and positive definite function (in the sense 
of S. Bochner) and f(0) = 1. A characteristic function f(t) is said to be infinitely 
divisible if for every positive integer n there exists a characteristic function f,(t) 
such that f(t) = [f,(t)]”. 
In [593] is proved the following result. 
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Result 17.33. Suppose that a characteristic function f (x) satisfies the functional 
equation (17.64a), where 0 < Bj <1, yi > O@ = 1 ton). Then (i) when 


n 
> Ain =1 
i=1 


and the first moment of f(x) exists, f is infinitely divisible; (i) the characteristic 
function g(t) = f (t)f (2) is infinitely divisible; and (111) when 


n 
> By =1, 
i=1 


f is the characteristic function of a normal distribution with zero mean. 
The following results are used in proving Result 17.33 regarding (17.62a). 


Proposition (i). Suppose f : Ri, > R*.. Then (17.64a) holds for all x > 0 if and 
only if there exists a function ¢ : R > R such that 


(17.66) f (x) = exp[x*A(logx)] for all x > 0; 
L.é., 


d(t) =e log f(e') forallt €R, 


and 


(17.66a) o(t)= > ujo(t— pj) forallteR. 


j=l 
Proposition (ii). Suppose ¢ : R > R (or any normal linear space), (17.66a) holds, 
and lim ¢(t) exists. Then is constant. 
t>—00 


Proposition (iii). Suppose f : Ri. > R%_, (17.64) holds for all x > 0, 


n 
Doe) 
j=l 


(equivalently, k > 0), and _ f (x) exists. Then either f =O or f(x) > 0 forall 
x04 
x > 0. Moreover, assuming f # 0, ifk = 0, then f is constant, and ifk > 0, then 
lim f(@)=1. 
x04 


Theorem 17.34. [93]. Suppose f : R > Rx, (17.64a) holds for allx € R, k € Z*, 
and f  (Q) exists. Then either f = 0 or there exists c € R such that 


(17.67) f(x) =exp[cx*] forallx ER. 


17.11 Statistics 721 


Proof. The proof is based on induction. Since f’(0) exists, f is continuous at 0. 
By Proposition (iii) (applied to the restriction of f to (0, +00)), either f(x) = 0 for 
all x > Oor f(0) = 1 and f(x) > 0 for all x > 0. Assume the latter. 

Let F(x) = log f(x) for x € R. Choose ¢ : R — R such that (17.66) and 
(17.66a) hold. 

Then F(x) = x*¢(log x) for all x > 0, F(O) = 0, and F’(0) = f’(0)/f(0) = 
f'(0). 

Suppose k = 1. Let 


F)- FO) _ 


c= F’(0) = lim = lim ¢(logx) = lim (ft). 
x04 t—+>—0o 


x04 x 


By Proposition (ii), @(t) = c for all t € R. Hence F(x) = cx for all x > 0 and 
thus f(x) = exp[cx] for all x > 0. The continuity of f at 0 thus implies that 
f (x) = exp[cx] for all x > 0. Let g(x) = f(—x) for x € R. Then 


gx) = f(—x) =] F@iCx))” = [] Gia” 


j=l j=l 


for all x € Rand g/(0) = —f'(0) = —c. Hence g(x) = e~™ for all x > 0. That is, 
f (x) = e™ for all x < 0. Thus our assertion is true when k = 1. 

Now suppose that k > 2. By assumption, there exists ¢ > 0 such that f@—) (x) 
is defined for all x € (—e, e). Hence 6“—")(t) is defined for all t € (—0«, loge) and 
hence, by (17.66a), for all t € R. It then follows from (17.66) that f(x) and 
F&-D (x) are defined for all x > 0. Now, for all x > 0, 

F(x) = xp (logs) 
and 
F'(x) = kx*"!@ (log x) + x‘! (log x) 


or 


F'(x) = x*"'{p' (logx) + kp (log x)}. 


For a € R, let a + a denote the linear differential operator defined by 


([5+4| v) ()=yw'@+ay(t), forallt eR, 


whenever y : R > Ris differentiable. Then 


F'(x) =x"! (Es + | é) (logx) forall x > 0. 
Xx 
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Similarly, if k > 3, we have, for all x > 0, 


F"(x) = x*-? (E Af 1 E es | 2) (log x) 


d d 
k-Diy) = me ae 
F w=([4 +2] E +4] 2) (log x). 


Let y=[4+2]---[44+4]¢. Then 
yw =p") + ay_rb*? +++» + aid! +.a0d, 
where do, 41, ...,4x—2 € R, are such that, for all z € C, 


k-2 


ZT 4 ay zk? +. +z t+ a0 = (+ 2)-+-(@ +4). 


Notice that a9 = k! It then follows from (17.66a) that 


N 

(17.66b) y(t) = >) ujw(t— pj) forallt eR. 
j=l 

We also know that 

(17.68) F&-Y(x) =xw(logx) forall x > 0. 


Now 


F(x) = log f(x) = > yj log f(Bjx) = >) 7j)F (Bix) forallx ER, 


j=l j=l 


so that 

n 
F'(x) = > Bjyj F'(Bjx) forallx ER. 
j=l 
Hence 
n 
FO) = 4 > Brit FO. 
j=l 
But 


n n 
> fie > bin =1 
j=l j=l 
since k > 2, so F’(0) = 0. If k > 3, then 


n 
F"(«) = > Biv) F" Bix) for allx €R, 
j=l 
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so that F’’(0) = 0 since 
N N 
DAiVi > DBi7s = 1 
j=l j=l 


Continuing this argument, we find that 0 = F (0) form = 1,2,...,k —1. 
But F“ (0) exists since f (0) exists and we know that F“—)(0) = 0. Hence, 
by (17.68), 


F&-YD(x) — FEYDOO 
F®(0) = lim cialis 2 Rate eal |) 
x—0t x 


= lim wilogx). 
x—0T 


By (17.66b) and Proposition (ii), y is constant, say y(t) = A € Rforallt € R. 
That is, for all t € R, 


PMO) apd 7) + $ad'O+ KDE = 
Hence there exist real constants c2,..., cx such that 


b(t) = coe 2 +--+ + cge + (A/k!) for allt ER. 


Thus 
F(x)= x* (log x) 
— yk] 2 
= [8+(8) 
A k —2 
— ii x*+cK. +: hax” forall x > 0. 
Since 0 = F’)(0) form = 1,2,...,k — 1, we have cy = --- = cx = 0. Thus, if 


c=A/k!, then F(x) = cx* for all x > 0 and hence, for all x > 0, 
(17.67) f (x) = exp[cx*]. 


Arguing as we did in the case where k = 1, we find that (17.67) holds for all real x. 
This proves the theorem. 


(iii) [144, 12]. This is characterization of the normal distribution by an important 
property. We try to find a probability density function of the form f(x — a), where 
a is the parameter, such that the maximum likelihood estimate of “a” is the mean 
value of the sample. This implies that the maximum of the functions 


[] fai -@ > Slogif@i - a] 


i=l i=1 
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must be attained at the sample mean. Thus, we must have 


n n n 

Xp tx. +x 
> i (« = wee) = == a h'(yi) =0, 2 yi = 0, 
I= i= i= 


where h(x) = log[ f (x)]. 
Substituting y; = 0 for alli = 1,2,...,n, we get h’(O) = 0, and substituting 
y; = 0 fori = 2,3,...,n — 1, we geth’(y) = —h’(—y). Therefore, we have 
h'(y1 + y2+++++yn-1) = h'(—yn) 
_ —h' (yn) 
=h'(y1) +A'(y2) +++ +A On-1), 


which is equation (A). Thus, h’(x) = cx and h(x) = cx?/2 + b. Consequently, we 
have f(x) = exp(cx*/2 + b). If we now impose the normalization condition 


[ fe-aar= 1 


we get 
(17.658) fw -a) = — 
65a x—-ay= exp | —-——~—],_ —00 <x <0, 
oV2n 20° 
which is a normal family of distributions. Note that c = —1/o?. 


(iv) Now we turn our attention to the functional equation 


(17.69) f@)g(y) = h(ax + by)k(cx + dy) 


forx,y €R, f,g,h,k:R—C, a,b,c,d € R4, and its generalization 


n 
(17.70) fxg) = | [riG@x + biy), 
i=1 

where f, g,hj : R- C, aj, bj € Ri with ajb; —ajb; #0, i, j A ton)i # j. The 
functional equation (17.69) has applications to the quantum mechanical three-body 
problem and to the characterization of normal distributions. 

At the eighth annual meeting on functional equations held at Oberwolfach in 
1970, the following two problems were raised that are special cases of (17.69). 


Problem (R.H. Hall). Find the general, even, square-integrable real solutions of the 
functional equation 


(17.691) f@®)gO) = (5 + 3s) (> = *). 


This has applications to the quantum mechanical three-body problem. 
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Problem (J. Aczél). Find the general continuous real solution of the functional 
equation 


(17.69ii) f (x)g(y) = flax + by)g (cx + dy) 
(a, b, c, andd real constants). 


At the same meeting A. Lundberg found the most general logarithmically differ- 
entiable solution of (17.691). In [334], (17.6911) was solved assuming f and g to be 


é ; : : : ; ab : 
twice continuously differentiable real functions and assuming al to be a rotation 
matrix. Lundberg also noted that if f and g were continuous solutions of (17.6911) 

: : : : ab 
that were not identically zero, then they were never zero, again assuming a to 


be a rotation matrix. 

Now we consider the functional equation (17.69) where f, g, h, and k are 
complex-valued functions of a real variable and a, b, c, and d are fixed nonzero 
real numbers such that ad — bc £ 0. Let P(x, y) = f(x)g(y) forx,y ER. 

If, in addition, P is Lebesgue measurable, then f, g, 4, and k are continuous. 
Using this and a theorem of Kemperman [529], we determine (Theorem 17.36) the 
continuous solutions of (17.69). 

The functional equation (17.34) was treated in [524], and this is also solved under 
weaker regularity conditions in [91] than were assumed in [524]. 


17.11.2.1 The Equation f (x)g(y) = h(ax + by)k(cx + dy) 


Result 17.35. (Baker [91]). Let f, g,h,k satisfy (17.69). If there exists a subset 
V of R? of positive Lebesgue measure such that P(x, y) = f(x)g(y) 0 for all 
(x,y) € V, then f (x)g(y)h(u)k(v) 4 0 for all x, y,u,v € R. If in addition, P is 
measurable, then f, g, h, and k are continuous. 


Theorem 17.36. [91]. Jf f, g, h, and k satisfying (17.69) are measurable and 
not almost everywhere zero, then there exist complex polynomials p, q, r, and s, 
of degree at most 2, such that f(x) = exp(p(x)), g(x) = exp(q(x)), A(x) = 
exp(r(x)), and k(x) = exp(s(x)) forall x €R. 


Proof. By Result 17.35, f, g, 4, and k are continuous and nowhere zero. 
Put x = 0 to obtain 
f(0)g (0) = h(O)k (0). 
Hence 
f@) gy) _ h@xt+by) kx +dy) 
f) g0)  ~—A(0) k(0) 


forallx,y €R. 
Letting log denote the principal branch of the logarithm in the right half plane 
and defining 
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F(x) = log(f ()/f )), 
G(y) = log(g(y)/g(0)), 
A (uw) = log(h(u)/h(0)), 
K (v) = log(k(v)/k(0)), 


for sufficiently small x, y, u, v € IR, we deduce from (17.69) that 
(17.69a) F(x) + GQ) = H(ax + by) + K (cx + dy) 


for all x and y in some neighbourhood of 0. 
Choose ¢ > 0 such that (17.69a) holds whenever |x| < ¢ and |y| < e. Integrating 
over [—é, €], we find, for |x| < ¢, 


2eF (x) +f G(y)dy = | (H(ex+by) + K(cx + dy)}ay, 


from which it follows, using the Fundamental Theorem of Calculus, that F is con- 
tinuously differentiable in a neighbourhood of 0. Similarly, G, H and K are contin- 
uously differentiable in a neighbourhood of 0. 

Differentiating (17.69a) with respect to x, we find that 


F'(x) = aH'(ax + by) + cK’ (ex + dy) 


for all sufficiently small x and y. Integrating this equation with respect to y over 
a suitably small neighbourhood of 0, we find that F” exists and is continuous in a 
neighbourhood of 0. Similarly, G, H, and K have continuous second derivatives in 
a neighbourhood of 0. 

Differentiating (17.69a) with respect to x and y, we find 


0 = abH"(ax + by) + cdK" (cx + dy) 
for sufficiently small x and y. Hence 
0 = abH"(u) + cdK"(v) 


for sufficiently small u and v so that H” and K” are constant in a neighbourhood of 
0. Similarly, F” and G” are constant in some neighbourhood of 0. 
In particular, there is a polynomial p such that F(x) = p(x), and hence 


(17.71) f(x) = exp(p()) 


for all sufficiently small x. 
Now, for any fixed x9 € R, 


Ff (xo) 8 O) = h(axo)k(cxo), 
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so that 
f@otx) gly) hlaxot+ax+by) k(cxo+cx +dy) 


fo) g@) h(axo) k(cxo) 
for all x, y € R. Now it follows from the equation above that f is analytic in a 
neighbourhood of x9. Thus, from the theory of analytic functions, we conclude that 
(17.71) is valid for all real x. Similarly, we obtain the required representations of 
g, h,andk. 


Remark. The continuous solutions of (17.69) are one of the following forms: 


Gg) f =0, h =0; gandk arbitrary. 
Gi) f =0, k =0; g andh arbitrary. 
Gii)g =0, h =0; f andk arbitrary. 
(iv)g =0, k =0; f and h arbitrary. 
(v) f(x) =a’ exp(ax? + Bx), 
g(x) =D’ exp(y x? + dx), 
h(x) = c' exp(ux? + vx), 
k(x) = d’ exp(px? +. 0x), 
for all x € R, where the constants a,b,c, d,a, B,..., are such that 


a'b'=c'd' £0, a’utc*p =a, but+dp=y, 
abu +cdp = 0, av+co =f, bv +do =o. 


Remark. There are measurable solutions of (17.69) that are almost everywhere zero 
but not identically zero. Indeed, let 


1 ifx=0, 


f@)= i ifx £0, 


and let f=g =h=k. 


The general solution of (17.69) is given in the following result. 


Result 17.37. (Lajko [600]). Suppose that the functions f, g,h,k : R > R satisfy 
the functional equation (17.69), where a,b,c,d € Ri are arbitrary constants with 


A = ad — bc # O and there exists a subset V C R? of positive Lebsegue measure 
such that f (x)g(y) 4 0 for all x,y € V. Then f, g, h, and k have the form 


f (x) = a exp[Ai(x) +. 11(x)], x ER, 
g(x) = az exp[A2(x) + n2(x)], x ER, 


Aw) =—fif (<x)e (-<x) , xel 


k(x) = fof (-*) g (++) , xel 


A 


A 
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where 
ai P\a2f2 = 1, 
ad bd be bd 
n(x) +n2(y) =m (4: +f =») +n (= 1 ~») 
ac be ac ad 
+712 (Ss + ~y) +2 (Ss + ») 


hold and the functions Aj :R > RG = 1, 2) satisfy (A) andn; :R > R@ = 1, 2) 
satisfy the functional equation (Q) (quadratic). 


n 
17.11.2.2 The Equation f (x)g(y) = [] Ai(ajx + bj y) 


i=1 
Now we treat the generalization (17.71). 


Result 17.38. (Baker [91]). If there exists a subset V of positive planar Lebesgue 
measure such that P(x, y) 4 0 whenever (x, y) € V, then 


f@egQ)hitu) 40 forallx,y,u€ Rand1 <i <n. 


If, in addition, f, g, andh; (1 <i <n) are measurable, then there exist polynomials 
DP.q,ri Ul <i <2) of degree at most n such that 


f(x) = exp(p(x)), (yy) =exp(q(y)), and hj(u) = exp(ri(u)) 
forallx,y,u € Randalli =1,2,...,n. 


Remark 17.39. If we set f(x) by f(x)g(x) and g(y) by f(y) g(—y), 41 = f, a1 = 
bj = 1, ho = g, ag = 1 = —), n = 2 (17.71) can be written as 


(17.72) f@+y)g@—y)= fa)fO)g@)g(-y). 


Now we consider (17.72) with motivation. 


(v) Suppose that u and v are independent random variables. If u + v and u — bv are 
independent, then uw and b are normally distributed. Functional equations are derived 
from this, and then we determine the most general solution of these equations, the 
regular solutions of which are normal distributions. 

At an international conference on “Inequalities and Functional Equations” held 
in Debrecen some years ago, in a conversation it was suggested by I. Olkin to 
C.J. Eliezer that the solution of the functional equation (17.72) would be of inter- 
est, as the equation arose in a problem on statistical distributions. Subsequently, 
the continuous solution of (17.72) was found in [254] (see also Van der Lyn [810], 
Olkin [654]). Here we are determining the most general solution of (17.72) and a 
connected equation (17.73) again connected to a normal distribution without assum- 
ing any regularity condition whatsoever. First we give the motivation for treating 
(17.72). 
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After obtaining the solution, Professors Eliezer and Olkin were contacted in April 
1991, and here is the gist of the reply from Professor Olkin in May 1991. The 
problem you refer to is this: Suppose that u and b are independent random variables. 
If uw + v and u — v are independent, then uw and v are normally distributed. 

The known proofs are based on characteristic functions (Kapur [525]). Here we 
obtain the solution of (17.72) by a simple and direct method without any assump- 
tions. 


Remark. The following assertion holds. If uj (@ = 1,2,...,m) are indepen- 
dent random variables and >* aju; and >° bju; are independent linear statistics with 
ajb; ~ 0, then u;’s are normally distributed (see Skitovich [749] and Kapur [525, 
p. 89]). 


The functional equation in the present case is derived as follows. Suppose that 
u and v have densities, and let x = “4*, y = 45*. The joint probability density 
function p of u, v is 


plu, v) = fil) fo(v) 


(this is the independence property); that is, p(x, y) = constant fi(x + y) fo(x — y). 
By hypothesis, x and y are independent, which says that (forgetting the constant) 


(17.73) fi@ + y) foe — y) = dix) d2(y), 


which will be treated in (vi). 
From (17.73), for x = 0, 


AiO) f2(-y) 


g2(y) = bi)” 
and for y = 0, 
_ fi) fo@) 
Pix) = dO) 


Consequently, (17.73) becomes 


Ai@+y)fraty)= AM AMA L(y) 


£1 (0)d2(0) , 
which can be written as 
(17.72) f@+y)gx —y) = f)FO)s@)g(-y), 
where 
foa= fi) i= fro) 


#1 (0)’ ~ ¢2(0)" 
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17.11.2.3 Solution of the Functional Equation (17.72) 


We first determine the most general solution of (17.72). 
Theorem 17.40. (Kannappan [473]). Suppose that f,g : R — Rv. satisfy the 
functional equation (17.72) for all x, y € R. Then f and g are given by 

1 
(17.72s) f(x) =ceIME(x), g(x) = —eI™ E'(x), 

c 
where E and E’ are exponentials satisfying (E), q is a quadratic functional satisfying 
(Q), and c is an arbitrary constant. 


Remark. The general solution of (17.72) is obtained without assuming any regular- 
ity condition on f or g. 


Proof. Note that neither f nor g can take zero value. With x = 0 = y, (17.72) gives 
f (0)g(0) = 1. So, without loss of generality, assume that f (0) = 1. 
Change y to —y in (17.72) to get 


(17.72a) fa-ygaty=fa)fCy)g@)g) forx,y eR. 


Notice the symmetric role played by f and g. Interchange x and y in (17.72a) and 
divide by the resultant equation to obtain 


E\(x) 
E ay = 
(E) E\(x — y) E10)’ 
where 
_ f@) 
(17.74) E\(x)= 7. 


From (E) there results E(x + y) = E1(x)E (y); that is, E; is exponential. 
Now, dividing (17.72) by (17.72a), we have 


Exa+y)  Ea(y) 


aca sa fi R 
b@=-7) Hey 
where 
(17.74a) n(x) = £2). 
g(x) 


Changing x to x + y in the equation above, we get 


Ex(x + 2y) = Ex(y) 
E(x) Ex(-y) 
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With x = 0, the equation above, gives 


E2(y) 


oe Ex(-y) 


(note that E2(0) = 1) so that 
Eo(x + 2y) = Eo(x)E2(2y). 


Thus E> is also exponential (E). Now, from (17.74a), we have 


BD EY) ey pS: 
g(x) g(—x) 
that is, 
f(x) f(—x) = g@)g(—x) forx eR. 
Set 
(17.75) T(x) = f(x) f(—*) = g(x)g(—x)  forx ER. 


Changing x to —x and y to —y in (17.72) and multiplying (17.72) side by side by 
the resultant equation and using (17.75), we obtain 


(17.76) T(x +y)T(« —y)=T(Q)T(y) forx,y ER. 


Noticing that T is positive and taking the logarithm on both sides of this equation, 
we have (Q) satisfied by g for x, y € R, where g(x) = log T(x). 
So, g is a quadratic functional and 


T(x) = f(x) f(—x) =e™™ forx ER. 
From (17.74) and the equation above, it follows that 
fx =eIME (x) forx ER. 
Since f and F; are positive, taking the square root on both sides, we have 
gx) 1 
f(x) =e7 Ei(x)?, 
which is the same as f in (17.72s). From (17.74a), we have 


g(x) = f (*)Eo(—x) =e“ Ey (x)? E2(—x), 


which is the same as g in (17.72s). This proves the theorem. 


Corollary 17.41. If f and g are measurable or continuous in the theorem, then the 
solutions of (17.72) are given by 


(17.77) f(x) = cote tbe g(x) = A gt tb2x 
(4 


where a, b,, b2, and c are arbitrary constants (refer to [254]). 
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Proof. Since f and g are regular, so are E, E’, and q, and E(x) = e?'!*, E(x) = 
eX and q(x) = ax~. The result follows. 
For f and g to be probability density functions, we must have 


- f@x)dx =1= [. g(x)dx. 


From e 
[ fenax =i, 
—oo 
we have 
é bi 
e«@-/r=l1., 
J/—a 
With m; = —#, o= -, f can be written as 


1 _ (x=my 2 
e Qo2 


V2 02 : 


which is a normal distribution. Similarly, g has the form 


f(x) = 


1 = (x=)? 
e 20 


g(x) = 


> 


2no2 
with mz = —=. 


We now state the result regarding (17.72). Let F be a field and F(+) a dyadic 
group (that is, the equation 2x = x + x = ahasa solution in F). Suppose f, g : 
F — F* satisfy (17.72). Then f and g are given by 


1 
fx) =ay@E@), sa@)= ZV OE), 


where y satisfies y(x + y)w(x —y) = w(x)? w(y)? (see (17.76)) and E, and E’ are 
exponential. 


(iv) Let X and Y be independent random variables with density functions f and g, 
respectively. If the random variables X + Y and X — Y are independent, then X and 
Y are normal distributions with the same variance (see [12, p. 109], [144], [600], 
[473], and Theorem 17.40). 


Proof. From Theorem 17.40, we see that 


(17.73) Aix)d2(y) = fi@t+y)faa—y), forx,y €R, 
holds. 


Proof (i). From Theorem 17.40, it follows that the general solution of (17.73) is 
given by 
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fi@) =cieMME(X), fale) = creIME'(x), 
and that 
(17.72s) f(x) =c3eI™E(x), g(x) = cgeI E' (x). 


The measurable (continuous) solutions are given by 
oy) ‘ 1 2 : 
(17.77) f(x) = cet thx , g(x) = Ret thax 
Cc 


and from Corollary 17.41, we obtain f and g as normal distributions. 


Proof (ii). Equation (17.73) is of the form (17.69) with f = ¢|, g = ¢2, h = fi 
witha =1l=b,k= fo,c=1=-—d. 

From Theorem 17.36 (see also [144]), we obtain (17.77), and thus f and g are 
normal distributions. 


17.11.3 Gamma Distribution 


The associated functional equation is 


Xx 


(17.78) fx)gy) = p+ y)¢q (=) for x, y > 0. 


This equation arises in statistics in the characterization of distributions as follows. 

Let X and Y be two nondegenerate and positive random variables, and suppose 
they are independently distributed. Then the random variables X + Y and X/Y are 
independently distributed if and only if X and Y have gamma distributions with the 
same scale parameter. 

If one assumes the existence of density functions f, g, p, and qg for the posi- 
tive random variables X, Y, X + Y, and X/Y, respectively, and that X and Y are 
independent and X + Y is independent of X/Y, then it follows that 


(17.78) f(x)g(v) = p+ y)q(x/y), x,y > 0. 


Result 17.42. (Baker [91]). The functions f, g, p, and q from R*, > R* satisfy 
(17.78) if and only if there exist real constants a, b, c, and d and functions A : R > 
Rand L,,L2: Ri — R such that ab = cd # 0, A is additive, L\, and L2 are 
logarithmic, and 


f@)= aeh@+Lir) g(x) = beA@)tLalx) | 


p(x) = ceA@+Li@+L20) and g(x) = det (te) +42(r) 


forallx > 0. 
The continuous solutions are given by 
fx) =ax%e”, g(x) = bxPe!®, 
p(x) =cx*tFe!*, and g(x) =dx*(14+x)~7. 
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17.12.1 Bose-Einstein Entropy 


Introduction 


Boltzmann’s “Vorlesungen tiber Gastheorie”, published in 1896, laid the founda- 
tion of thermodynamical entropy. However, this notion of entropy was only limited 
to physics prior to 1948. Shannon’s “A mathematical theory of communication” 
(Shannon [732]), published in 1948, is the Magna Carta of the information age. 
Shannon’s discovery of the fundamental laws of data compression and transmis- 
sion marks the birth of information theory. A key feature of Shannon’s information 
theory is the discovery that the colloquial term information can be given a mathe- 
matical meaning as a numerically measurable quantity based on probability theory. 
Shannon defined this measure of information based on a discrete probability distrib- 
ution (~1, P2,.--, Pn) as (SE) (Shannon entropy) (see Chapter 10). 

Over the last fifty years, several other information measures have been discov- 
ered. One such measure of information, known as Bose-Einstein entropy (Kannappan 
and Sahoo [515]), is given by 


n n 
(17.79) Hn (Pi, p25-++s Pn) = — >) Pe log pe + > (1 + pe) log(l + px). 
k=1 k=1 


In Kapur [524], this entropy is used for deriving the well-known Bose-Einstein dis- 
tribution in statistical mechanics. 

Today, information theory (see Cover and Thomas [177]) interacts through the 
notion of information measure (or entropy) with many scientific disciplines such as 
probability theory (e.g., Central Limit Theorem, large deviations, random processes 
and divergence, queueing theory), statistical inference (e.g., minimum description 
length, hypothesis testing, parameter estimation, density estimation, spectral esti- 
mation, Bayesian statistics, the inverse problem, pattern recognition, neural net- 
works, and speech recognition), computer science (e.g., algorithmic complexity, data 
structures, hashing, cryptology, computational complexity, and quantum computing), 
mathematics (e.g., ergodic theory and dynamical systems, combinatorics, graph the- 
ory, inequality, harmonic analysis, differential geometry, number theory, and control 
theory), physics (e.g., thermodynamics, statistical mechanics, quantum information 
theory, and chaos), economics (e.g., portfolio theory and econometrics), and biology 
(e.g., molecular biology, DNA sequencing, and sensory processing). These interac- 
tions suggest that the notion of information measure is a fundamental concept. Many 
researchers have studied this notion extensively in the last fifty years. Today there is a 
mathematical theory concerning various information measures (see [43] and [245]). 

Measures of information are one of the sources that provide a large class of func- 
tional equations for investigation (see Chapter 10). While characterizing Shannon 
entropy by the properties of symmetry and recursivity, one encounters the funda- 
mental equation of information (FED theory. The Shannon function s(x), given by 
(SFP), is a solution of this fundamental equation. 
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The Bose-Einstein entropy H, admits a sum form; that is, 


nN 
An (D1, P2, +--+ Pn) = >, F (pp) 
k=1 


where the generating function F : J > Ris given by 
(17.80) F(x) = —xlogx + 1+ x)log( +x). 


We refer to the function F as the Bose-Einstein function. Like the Shannon function, 
s(x), the Bose-Einstein function, F,, satisfies the functional equation 


y _ x 
(17.81) fa)+(+x)f (=) =fO)+0+yf (= 7 -) 
for all x, y € J. A generalization of (17.81) is the functional equation 

y _ x 
(17.81a) f@)+U+x)g (=) =h(y)+(U+y)k (=) 


for all x, y € 7. Another generalization of (17.81) is 


u D 
G(x, y)+U+x)G (=. — 
(17.81b) re ey 


x J. 
= 1 
G(u, v) + ( +06 (. .) 


forallx,y,u,o EJ. 
Now we determine the locally integrable solutions of these functional equations. 


17.12.1.1 Solution of Equations (17.81) and (17.81a) 


In this section, the letter F will be used exclusively to denote the Bose-Einstein 
function. The following theorem gives the locally integrable solution of (17.81). 


Theorem 17.43. [515]. Let f : I > R satisfy (17.81) and be locally integrable. 
Then f is given by 


(17.82) f(x) =-cF(x)+ax, forx el, 
where a and c are constants. 


Proof. First we prove that local integrability implies the differentiability of the func- 
tion f. Integrating (17.81) with respect to y from c to d ((c, d) © [0, 1]), we have 


d d d 
w-onir- [rom feonifa)or- eva (GE) 


7 1 stay sft 
=} fordy- 5 [ s“f(s)\ds —(1+x) i. f(t)dt. 
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Since f is integrable, the right side is a continuous function of x, and so, from the 
left side, f is continuous. Now using the continuity of f, the right side and thus the 
left side imply the differentiability of f. Then, again, f is twice differentiable. 

Differentiating both sides of (17.81) first with respect to x and then with respect 
to y, we obtain 


oa ()- ears (=). 


and using rt = | we get 


Setting u = ~~, v= 


= 
tad 


x 
T+y’ 


" 


me OF 


that is, 
u(l +u)f"(u) = o(1 +0) fv) =e, 
where c is a constant. From the last equation, we have the second-order differential 
equation 
c 
" = 

so 

fu) = cllogu — log(1 +.u)] +a. 


Solving the first-order differential equation above, we obtain 
ft) =c[ulogu — (1+ u)log +u)]+au+b. 


Setting y = 0 in (17.81), we see that f (0) = 0, so b = 0. This proves the theorem. 


Now we give the solution of the functional equation (17.81a), which will be used 
to solve (17.81b). 


Theorem 17.44. [515]. Suppose f, g,h,k : I > R satisfy (17.81a) and one of them 
is locally integrable. Then 


f(x) = -cF (x) + (a2 — b1)x +53, 
gy) = —cF(y) + ary +b, 
h(u) = —cF(u) + (ay — b2)u + ba, 
k(v) = —cF(v) + anv + bo, 


(17.82a) 


with b, + b3 = bo + ba, where a, and az are arbitrary constants. 


Proof. It is easy to see that if any one of the functions f, g, h, k is locally integrable, 
so are the others. We also have, by letting y = 0 and x = 0 respectively in (17.8 1a), 


(17.83) f(x) + (1 + x)g(0) = AO) + k(x), 
(17.83a) FO) + 8) =hY) + + y)k(0). 
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As in Theorem 17.43, it can be shown that f and hence g, h, and k are twice 
differentiable. Differentiating first with respect to x and then by y, we get 


y e"( y )- x K( x ) 
(+x)? 1+x (d+y)? I+y}- 


. _ y ae x 
As before, taking u = yz, 0 = Try we have 


u(1 + ug” (u) = vo(1 + v)k"(v) =e, 
where c is a constant. Thus 
g(u) = —cF(u)+aqut+h, kv) =—-cF(v)+a20 + bo. 


Now we use (17.83) and (17.83a) to get the asserted solution. This completes the 
proof. 


17.12.1.2 Solution of Equation (17.81b) 


In this section, we prove the following theorem. 


Theorem 17.45. [515]. Let G : I x I > R satisfy (17.81b) and G be locally 
integrable in its first variable. Then G has the form 


(17.82b) G(x, y)=—-cF(x)+xLQy)—-Ud+x)L0 4+ y)+ax, 


where a and c are arbitrary constants and L is logarithmic (L) with 0log 0 = 0. 


Proof. If we temporarily fixed y, v in (17.81b), then (17.81b) becomes (17.81a) with 
f@) = G(x, y), g4%) =G (x, rh): etc. Since G is locally integrable in its first 
variable, applying Theorem 17.44 we get 


G(x, y) = —c(y, v) F(x) + a(y, v)x + bly, v), 


where a, b, and c are functions of y and v. Since the left side is a function of y only, 
a, b, and c are functions of y only, say a(y), b(y), and c(y). Since c occurs in g, h, 
and k, cis aconstant. Thus 


(17.84) G(x, y) = —cF(x)+a(y)x + b(y). 


To determine a(y) and b(y), we substitute the form of G in (17.84) into (17.81b) 
and obtain 


D D 
a(y)x + b(y) + ua (=) +(1+x)b (—) 


=a(v)u+b(v)+xa (=) +0+0(). 
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Equating the coefficients of x and the constant terms, we have 


y 
(17.85) ay) +0( 72.) = «(). 


17.85 b b{ ——]=b) b 
(17.852) (y) + ()- (0) + (=). 
To find a(y) and b(y) (cf. [450]), we subtract (17.85a) from (17.85) to get 


— Bg — 
(17.85b) a(y)=a (= a5 -) bv), 


where a(y) = a(y) — b(y). Letting y = 1 + 0, we obtain a(1 + v) = a(1) — Dv). 
Next, setting +4, = ¢ in (17.85b), we get 


a(t(1+v)) =a(t) +a +0) —a(); 


that is, 
L(xy) = L(~) + L(y), 
where 
(17.85c) L(x) =a(x)-—a(Q1) =a(x) +c (say). 


From (17.85b) and (17.85c), we see that 


Liy)-ca =L (=) —c;—b(v) 
or 
(17.86) b(v) =—-L(+ vd). 
Using this and (17.85), we have 


l+y+o) _ y 
ay —L( T+y )=(4) 


Vio 
Next, we set To =t. Then 


y y _ = 1\\ | 
aoy- L(+) =e (a (14; )) =a 
or 


Fo) xa (ee 
(17.86a) ay) -L (2) =a L(7) =a. 


where a is a constant. Now (17.84), (17.86), and (17.86a) yield the asserted solution 
(17.82b), and the proof of the theorem is complete. 


The general solutions of these equations without any regularity assumptions will 
appear elsewhere. 
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17.12.2 Sums of Powers 


A method is introduced to find the sum of powers on arithmetic progressions by using 
Cauchy’s equation and obtain a general formula. Then we apply these results to show 
how to determine some other sums of powers and sums of products. These results 
are more general. Finally, we discuss the sum of powers on arithmetic progressions 
in commutative rings with characteristic 2 and find “full polynomials”. 

Since the time of James Bernoulli (1655-1705) [122], finding the sum of powers 
of integers 


(17.87) Sen) = 1 +2 4... +n*, kEZy, neZt, 


has been fascinating mathematicians (see Chapter 1, Section 1.8 application), and 
several methods have been developed to find the sum. One of the methods to find the 
sum in (17.87) uses the theory of functional equations. In particular, in Kannappan 
and Zhang [523], the sum S;(m) can be determined by applying the known recur- 
rent formula (Snow [758], Au- Yeung [836]), grouping terms of the same power and 
reducing them to Cauchy’s additive equation. This method is also applicable in com- 
binatorics and genetics [523]. Inspired by [55], we introduce this method to find the 
sum of powers on arithmetic progressions 


(17.88) Sx(n;a,h) =a + (ath +---+[a+Q—lnlk, keZy, n eZ, 


where a and / are given real constants, and obtain a general formula for S;(”; a, h). 
Applying our results, we determine some other sums of powers and sums of products 
such as 


k-1 k-1 k-1 
(17.89) Pr(nsa,h) =] [a+ s+] [@atjitnt+---+]]@+i+@- vn). 
j=0 7=0 j=0 


Results in [523] will be generalized here. Finally we discuss the sum of powers on 
arithmetic progressions in commutative rings, especially in commutative rings with 
characteristic 2, and find full polynomials. 


17.12.2.1 A General Result 
Theorem 17.46. [523]. For each integer k > 0, 
(17.88a) Sk(n; a, h) = n(uren® + ween! + we ean? ++ + Mk N+ MK): 


where k,j = Lk,j(a,h), j = 0,1,2,...,k, are (homogeneous) polynomials of 
degree k ina and h and are given by 


a.hy= nk 
Ltk,k (a, h) ia 


(17.88b) a re ee 
Mk,k-i (a, h) = oj) a= > 3 ; 
j=l 


fort = No 2 yan ks 
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Proof. The proof of this theorem is based on induction. First, for k = 1, we easily see 
that (17.88a) holds for all natural numbers n (see [836]). Assume that, for all natural 
numbers k < f, the formula (17.88a) holds for all natural numbers n. We prove that 
(17.88a) holds for k = ¢ + 1 and all natural numbers n. For n = 1, andt+ 1 fork in 
the right-hand side of (17.88a), 


(17.90) treaties et Metal te Fert Ft er4+1,0 = git} = S411; a, h) 


because 


t+1 
Ht+10=4 = Ht+1,1 cm Mette — Mt+1,t+1 


by (17.88b). Furthermore, suppose that (17.88a) holds for k = ¢ + 1 and n; that is, 
S;41(n; a, h) holds for all natural numbers n < m, and then consider the next case 
m + 1. By (**) in Chapter 1, we have 


t+1 
t+1 . 
SresCm + Leah) ~ Seems a.) = alt + >” ( H Jeena, 


i 
i=1 
By induction hypotheses for k and for n, 


t+1 
t+1 ; 
Sp4i(m + 1; a,h) = Sy41(m; a,h)+ > ( 7 ) Sera a,h)h' + qit! 
i=l 


t+1 
t+2-—j 
= >) wii z 
j=0 
t+1 PE t+1-i 
t+2-—i-jpi t+1 
+>( i ) D> Bett—aepiinjn tat + al 
i=1 j=0 
t+1 i 
t+1 . 7 
(17.91) = a | ) oes jaetci mete git 
=o \ieo ¥ 


written as a polynomial in m. Observe that 


t+ 
D> eet et1-jOn + PS 
j=0 
t+1 t+2—j oe 
= > meee; Y ( ; mts 
=O i=0 
ttl [i aoe t+1 
t+2-i+] = 
(7.9la) =>) >( j ) erste mS wert j- 
i=0 \j=0 j=0 


By comparing coefficients of powers of m between (17.91) and (17.9 1a), it suffices 
to prove that, fori = 0,1,...,¢+4+ 1, 
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i 


a ee ee ‘(t+ ; 
(17.92) >( Virrswriins =D ( j )oestjaetaatl 


j=0 J j=0 
t+1 

(17.92a) wrt iepi-j =a’ 
j=0 


Actually, (17.92a) is (17.88b). Equality (17.92) also holds because, for each j = 
0; Ts cats 


Ce 


j ) esas 


a, 1 i 
7 j Pl=1+ j= J 


ix j 
aT —D ni-je 
f=] 


(+0)! — 
7 _ Ne ij oe pier 
ji@—pie+2—H)! \* DH mye 
oe A geeiieg (j-i_¥ 
-(% (4 oe aI — Yi mimic | hd 
j Jt+t2-i r— J f=1 . 


t+1 : 
= j Mr+1—jt+1—i 


by (17.88a). This proves that (17.88a) holds for k = ¢t + 1 and all n, and this 
completes the proof. 


Aczél [21] obtained the general sum as 


1 SO fk+1 
_ sa 
(17.88c) Si (n; a, h) = a) 2 (; i i) ce_jhi, neZi, 
where constants ce safisfy 
Safe j-l c-1 
> _jce-jhe =ta, =F =1,2,.... 
ja 


These formulas are the same as (17.88a) with (17.88b). For example, we identify the 
cases for k = 1, 2. For k = 1, from (17.88c) we get 


1 
co=l, cy =a—h/2, and S\(n;a,h)= gln@a —h) +n7hl, 
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while from (17.88a) and (17.88b) we have 


an 5a, : 
H1,1 1/2, M1o=a > 1; a,n n|\{a 5 5” 


Both results are the same. For k = 2, from (17.88c) we get 


a4 _ h 2 h ; h? 
co=l1l, cy=a > 3a a 5 h 3° 


and 


1 / h? / 
So(n;a,h) = 3 [>»(« - (« - *) h- =) eae (« - *) +n] : 


On the other hand, from (17.88a) and (17.88b), we have 


_ We =f h eg h h? 
12.2 = 3° L211 = a a)? H20=a 11a 5) 37 


Sea h2 ae h f Ca h h? 
a(n;a,h)=n a7 a 5 n a a 5 5 : 


We also get the same results. 
Now we find the sum 


and 


k k k 
n(n) = 1k 45k 4...4 (4n—3)', k=1,2,3, ne Z*, 


given in [472] by our formulas. 
We can consider sums of powers of integers with positive and negative terms, 
too. For example, consider the sum 


se(n, -1) = -1F + 2% 4-.-4(-1)"n®, ke Zs, ne Zt. 
It is easy to see that 
n n 
se(n, —1) = Sx (5; 2, 2) — Sx (5; 1,2) 


for n even and that 


(n,-l1l)=S : 10,2 -5§ n= 1,2 
SE(n = S, {| ——; 0, .{ ——; 1, 
kW, k 5) k 5) 


for n odd. In particular, s2(2n,—1) = S2(n; 2,2) — S2(n; 1,2) = 2n* +n and 
s2(2n + 1,-1) = S2(n; 0,2) — S2(n; 1,2) = —2n? +n. As a second example, 
consider 
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Pa 8 ee Ch 2 = Gea IP + Oe) 
= —S,(n; 1,3) — So(n; 2,3) + S2(n; 3, 3) 


9 5 
=. 25,9 a. 2 a * 
= —3n ey Tin forn eZ). 


Some Generalizations 


Now we consider the sum S;(n; a, 1) on a divisible commutative ring (IT, +, -) (that 
is, witha,h € T andk € Z,, n € Z*) and generalize our results. Here “divisible” 
is meant in its traditional sense that for all n ¢ Zi and b € I there existsac € I 
such that b = nc (no uniqueness required). Obviously, all basic results discussed 
above are valid on such a ring. 

Furthermore, if we consider the sum S;(n; a, h) ona divisible commutative ring 
(T, +, -) of characteristic m € Z% (that is, mb = 0 for all b € I,) then we observe 
some interesting phenomena. 


Theorem 17.47. [523]. Fora, h in a divisible commutative ring (1, +, +) of char- 
acteristic m € Zi, 


m—1 k 


BY DVEGat thi ifn = fm, BeZ,, 
j=0 i=1 
ya +(B+1) ifn = Pm+y, 
: _ y-l k — 
(17.93) Se(n; a, h) = Os (jak ini BeZ,, 
j=0i=1 
m—-1 k a 
+2>> &(jak*h', y =1,...,m—1, 
j=y i=l 


where &(j) = (*)j' (mod m) € {0,1,...,m— 1}. 


Proof. Let n = {m+ y, where f € Z; andy = 0,1,...,m — 1. Since the ring 
(T, +, -) is of characteristic m, we see mh = 0. Notice that 


k 
(at jnk sak +> eG)ah', O< j<m-1, 
i=l 


where é(/) is defined in the theorem. It follows from (17.88) that, when y = 0, 


n—1 m—1 
Ske(n;a,h) = Si@ + jh) =p DC + jhyk 
j=0 j=0 
m—-1 k 


=na* +B >) > &(at in’, 


j=0 i=! 
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and when y 4 0, 


n—-1 y-l m—1 
Se(n;a,h) = Do(at jh)* =(B+1) >t jay +B > G@+ jn) 
j=0 j=? 
y-l1 k m—-l1 k 


=na*+(8+1) >) > 4G)a* ‘a +8 > > aG)at th. 


j=0 7=1 j=y i=1 


j=0 


This proves the theorem. 


More concretely, with the characteristic m = 2, by Theorem 17.47 we see 


Scena Sh? (even n), 
, a> +h? (oddn), 
re $[a*h + ah? +h] (even n), 
3m; a,h)= J", nap 9 24. 93 
a’ + -la“h +ah*+h°| (odd n), 


and so on. As is known from the proof of Theorem 17.47, the scale of representations 
(17.93) is clearly decided by the expansion of (a + jh)*. The expression (17.93) is 
said to be full if all coefficients of powers of a and h are nonzero. For m = 2, 
expanding (a + jh)* patiently up to k = 15, we see that the full expression occurs 
fork = 3,7, 15; ie., 


(athp=a+a*ht+ah? +h’, 

(a+ hy! =a’ +a°h+a>h? + ath? +a°h* + ah? +. ah® oh. 

(a ey) a — a> +al*h + ap? +a'*n3 tal'yt + a!p? + a?h® + adh? 
+7 +O on” ath 40h? tah? tan sn. 


We refer to such polynomials as full polynomials. It will be interesting to find k’s for 
which we have a full polynomial of degree k. Trying it with Maple V5.1 software, 
we found that the fourth time for all terms to occur or for full polynomials to occur is 
when k = 31, and the fifth time is when k = 63, where the lists of integer coefficients 
are given respectively in the following: 


[31, 465, 4495, 31465, 169911, 736281, 2629575, 
7888725, 20160075, 44352165, 84672315, 141120525, 
206253075, 265182525, 300540195, 300540195, 
265182525, 206253075, 141120525, 84672315, 44352165, 
20160075, 7888725, 2629575, 736281, 169911, 

31465, 4495, 465, 31] 
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and 


[63, 1953, 39711, 595665, 7028847, 67945521, 
553270671, 3872894697, 23667689815, 127805525001, 
61590256823, 2668424446233, 10468434365991, 
37387265592825, 122131734269895, 366395202809685, 
1012974972473835, 2588713818544245, 6131164307078475, 
13488561475572645, 27619435402363035, 
52728013040874885, 93993414551124795, 
156655690918541325, 244382877832924467, 
357174975294274221, 489462003181042451, 
629308289804197437, 759510004936100355, 
860778005594247069, 916312070471295267, 
916312070471295267, 860778005594247069, 
759510004936100355, 629308289804197437, 
489462003181042451, 357174975294274221, 
244382877832924467, 156655690918541325, 
93993414551124795, 52728013040874885, 
27619435402363035, 13488561475572645, 
6131164307078475, 2588713818544245, 
1012974972473835, 366395202809685, 
122131734269895, 37387265592825, 

10468434365991, 2668424446233, 615790256823, 
127805525001, 23667689815, 3872894697, 

553270671, 67945521, 7028847, 595665, 

39711, 1953; 631]. 


Trying some more for k = 127, 255,..., we observe that full polynomials ap- 
pear regularly fork = kj, i = 1,2,..., where kj = 3 and kj4, = 2k; + 1. For 
convenience, we say that those integers are totally odd. Obviously, for k = ko := 1, 
the corresponding polynomial is also full. 


Result 17.48. [523]. Fora, h in a divisible commutative ring (T', +, -) of character- 
istic 2, the sum Sx(n; a, h) is a full polynomial in a and h if, and only if, k is totally 
odd. 


Remark. It will be interesting to see how the characteristic m affects the sum 
Si(n; a, h) when m 4 2. 


17.13 Behavioural Sciences 


Functional equations are useful in the experimental sciences because they offer a 
tool for narrowing the possible models for a phenomenon. A model can be for- 
mulated by one or more not very restrictive equations, which when paired with an 
empirical or logical constraint of a general character lead—via functional equation 
techniques—to precise quantitative relationships. The article by Aczél, Falmagne, 
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and Luce [47] reviews various applications of functional equations in some areas of 
the behavioural sciences such as sensory psychology (psychophysics), utility theory 
under uncertainty, and aggregation of inputs and outputs in an economic or social 
context. 

The purpose of this section is to describe some of the uses of functional equa- 
tions in the behavioural and social sciences. Applications of functional equations in 
three empirical fields are discussed: examples in sensory psychology (in particular, 
psychophysics) resembling the Plateau example but more complex and covering dif- 
ferent situations; individual decision making under uncertainty (utility theory); and 
aggregation of inputs and outputs in a social or economic context. We start with a 
summary of some classic types of functional equations, solved long ago (for details, 
see [32]), that have proved useful in the analysis of behavioural models from a func- 
tional equation viewpoint. In all cases that we consider, the functional equations are 
numerical ones, generated by a mathematical model or a numerical representation of 
the phenomenon under consideration, in which the modelling is incomplete and at 
least some of the functions are unspecified. A substantial body of literature on rep- 
resentational measurement theory describes classes of qualitative systems that give 
rise to numerical representations. 


17.13.1 A Behavioural Example 


Functional equations arise in the sciences because they allow researchers to formu- 
late mathematical models in general terms, via some not very restrictive equations 
that only stipulate basic properties of functions appearing in these equations, without 
postulating the exact forms of such functions. However, the data or the experimental 
situation itself may exhibit regularities or symmetries that can be captured by some 
other equation(s) involving the same functions, thereby constraining their forms and 
specifying the model. 

Functional equations enter the picture in at least three distinct ways. One occurs 
when some invariance property holds. An example of such an invariance is some 
type of homogeneity of one of the functions. Other examples of invariance involve 
utility and weighting functions from utility theory. 

The second example arises when two independent measurement schemes give 
rise to distinct measures of the same underlying attribute. An example in physics is 
mass. It can be measured by concatenating objects on the pans of a pan balance (in 
a vacuum) that is used to determine the order of greater mass. The resulting mass 
measure is additive over concatenation. It can also be measured fundamentally by 
varying the container size and the choice of homogeneous substances within them on 
a pan balance. This measure of mass exhibits a product structure with corresponding 
measures of volume and density. Clearly, because both mass measures represent 
the same ordering, they are related by a strictly increasing function. Any additional 
empirical law linking the two measurement structures imposes a constraint on that 
function in the sense that it must satisfy a functional equation. In the physical case, 
the link is a qualitative distribution law. 
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The third example concerns what may be thought of as consistency principles. 
One such principle is the commutative property used in modelling aggregation. Sup- 
pose we have variables x;; that can be aggregated over either i or j and, once done, 
over the remaining subscript. An economic example concerns several types of inputs, 
such as raw materials, energy, and labour, to some production. Often one wishes to 
speak of an aggregated result over each type of input and over the outputs. The 
question is under what circumstances the order of aggregation does not matter. 

Let m(x, y) be the luminance of a disk appearing midway between the disks x 
and y. Plateau’s data can then be formalized by the homogeneity equation 


m(Ax, Ay) =Am(x,y), 4,x,y > 0, 


where the value of 4 reflects the differing conditions of illumination. (Realistically, 
the domain of m should be restricted to a suitable positive region near the origin). 
There is some sensory scale u mapping the lux scale into the reals, such that 


ulm(x, yy] = MO HO) 


A function m satisfying this equation for some continuous strictly monotonic func- 
tion uw is called a quasiarithmetic mean (see Chapters 15 and 17). It is assumed that 
the scale wu is defined on R*,. The equation above leads to the functional equation 


=4 | fat je 
u ef] HM a 


17.94 
(17.94) 5 5 


|: A,x,y > 0, 


which has only two families of solutions for the function wu. In addition to Cauchy, 
Pexider, and Jensen equations, u satisfies their generalizations, such as 


u(Ax) =m(A)ju(x) +n(A), Ax € Ri, 
which is a special case of (1.94b) and (1.94a). Here we need only u(x). From Re- 
sult 1.61, we have either 
aL(xy) + 2b 


u(x) =aL(x) +b or u(m(x,y))= 5 ; 


where L satisfies (L). Since u is continuous, we have 
1 
u(x) =alogx +b, m(x,y) = (ay)?, 
or 


u(x) = c(M(x) — 1) +), u(m(x, y)) = eM se ME ea 2B 


2 
where M is multiplicative (M); that is, since u is continuous, 
B+ yB\P 
u(x) = cx? +d, m(x,y) = (=) 


where a,c, 6 4 0 and b and ¢ are constants. For strictly increasing solutions, a > 
0, c,h > 0. 
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17.13.2 Psychophysics 


Many applications of functional equations in psychophysics are in the style illus- 
trated by Plateau’s situation discussed earlier. On the experimental side, one starts 
with some homogeneity equation that seems to capture an important regularity in 
the data. In the case of Plateau, the function m is homogeneous of degree 1. Next, 
one introduces some reasonable model, involving one or more unknown functions, 
formalizing a possible mechanism for explaining the behaviour of the individuals in 
the study. 

The cases reviewed here all involve a binary discrimination situation in which an 
individual must decide which of the two stimuli presented has more of some sensory 
attribute. For instance, the individual compares the loudness of pure tones differing 
only in their intensities, which is measured in standard ratio scale units (such as 
sound pressure level). 

In a classic nineteenth century case involving E.E. Weber and G.T. Fechner, the 
individual is presented with a pair of stimuli (x, y) and the researcher’s task is to es- 
timate experimentally the probability P(x, y) that a stimulus of intensity x is judged 
as louder (or brighter, longer, etc.) than a stimulus of intensity y. In mathematical 
terminology, the so-called Weber Law states that the function P : Ri — 10, I[ is 
homogeneous of degree 0: 


P(Ax, Ay) = P(x, y), x,y,4> 0. 


Fechner’s contribution consisted of proposing that the individual’s judgment re- 
sulted from a computation of the differences between the two sensations produced by 
x and y, this difference being evaluated in terms of some unknown sensation scale. 
Formally, Fechner’s idea is expressed by the equation 


P(x, y) = Fu) —u)), 


where the functions u and F are real-valued, strictly increasing, and continuous, but 
otherwise not specified a priori. In the context of Weber’s Law, the possible forms 
of u and F above are severely constrained. With the two equations above, we get the 
functional equation 


FUu(ax) —u(y)) = Fu) —uQy)), *>0, y>0, 24> 0, 
which, since F is strictly increasing, is equivalent to 
u(Ax) —u(dy) =u(x)-—u(y), x >0, y>0, A> 0; 


that is, 
u(Ax) — u(x) =u(dy) — u(y) = dA), - say, 


or 
u(Ax) = u(x) + bi), 
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which is logarithmic Pexider. So 


Xx 


u(x) =aL(x)+b and Persy) + F (at ( )). x,yeRi, 
y 


where L satisfies (L). 
Since u is continuous, 


u(x) =alogx+b and F(x)=P (c?, 1). 


17.13.2.1 The Conjoint Weber Law 


Suppose that an individual wearing earphones is presented with a 1000 Hz tone de- 
livered simultaneously, with different intensities, to the two auditory channels. The 
impression created by such a stimulus is that of a single sensation, the loudness of 
which depends on the combination of two inputs. (The location of the sensation in- 
side the individual’s head also depends on the combination of the two intensities, but 
this aspect of the phenomenon is not relevant here.) We write (a, x) for such a two- 
dimensional stimulus, where a and x stand for positive real numbers denoting the 
sound pressures in the left and right auditory channels, respectively. Extending our 
earlier notation, we write P(a, x; b, y) for the probability that the individual judges 
the two-dimensional stimulus (a, x) to be louder than the two-dimensional stimulus 
(b, y). Various concepts concerning the function P are gathered in the definitions 
below. 


Definitions 


We suppose that P : R4 — ]0, I[ is continuous, with P(a, x; b, y) strictly increas- 
ing in a and strictly decreasing in b, and strictly contramonotonic in x and y (that 
is, P(a, x; b, y) is either strictly increasing in x and strictly decreasing in y or vice 
versa). We will call such a function a discrimination probability. Note that the hy- 
pothesis that P is strictly comonotonic in the second and fourth variables is weaker 
than what is suggested by the binaural loudness summation example but makes sense 
because the formalism is then also applicable to other empirical cases. For instance, 
a pair (a, x) could represent a stimulus of intensity a presented over a “noisy” back- 
ground of intensity x. 

A discrimination probability P satisfies the Conjoint Weber Law if it is homoge- 
neous of degree zero; that is, 


(17.95) P(ia, Ax; Ab, Ay) = Pla,x;b,y), 4,a,x,b,y > 0. 


It is easily shown that (17.95) holds if and only if there is a function T : R3 —> Ip 
such that 


(17.95a) P(a,x;b, y) =T(a/x,x/y,b/y), 


with T strictly increasing in the first variable and strictly decreasing in the third 
variable. 
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Result 17.49. [47]. Suppose that P : R4 — Ip is a simply scalable discrimina- 
tion probability satisfying the Conjoint Weber Law (17.95). Then, one of the two 
following possibilities holds: 


(a) There are some real-valued, continuous, strictly monotonic functions T; > 0 and 
c satisfying 

T) (a/x)x 

T(b/y)y 


(b) There is a continuous function Qo, strictly increasing in the first argument and 
strictly decreasing in the second argument, such that 


P(a,x;b,y)= Qo(a/x,b/y). 


Result 17.50. [47]. Suppose that P : R4 — Io is a discrimination probability 
satisfying the Conjoint Weber Law (17.95), together with the component additivity 
equation 


Pla.sidsy) =e ). a,x,b,y > 0. 


Pla,x;b,y)=Hf@+rQ), fO)+rQ)), 4,x,b,y > 0, 


where f, r, and H are continuous functions, with f strictly increasing, r strictly 
monotonic, and H strictly increasing in the first variable and strictly decreasing in 
the second variable. Then one of the three equations 


a? + bxF 
bP + dyh J? 


Ce me ed 
(a, x; y= bPyy > 


a b 
P(a,x;b,y)= Qo (.°). 
x y 


Pla,xib,y)= G( 


must hold for a,x,b, y > 0, where G is continuous and strictly increasing, and 
B > 0, 640, andy 40 are constants and Qo is continuous, strictly increasing in 
its first argument, and strictly decreasing in its second argument. 


Thus the function P is necessarily decreasing in its second argument and increas- 
ing in its fourth argument. 
For more details on the applications above and utility theory, see [47]. 


17.13.3 Binocular Vision 


A general theory of the relationship of binocular visual space to physical space is 
formulated within a conjoint measurement framework. Psychophysically motivated 
invariance relations induce various functional equations. Another class of functional 
equations arises if we additionally assume the validity of a different psychophysical 
theory that is based on a formula suggested by A.A. Blank. We interpret most of 
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these equations. The results obtained lead to a measurement-theoretic foundation 
of the psychophysical assumptions underlying the Luneburg theory of binocular vi- 
sion. They also contribute to clarifying the relationship between the general theory 
presented and Blank’s approach. 

We perceive that we are located in a unitary and stable perceptual space. The 
so-called visual space is distinguished from other perceptual spaces by its com- 
pelling phenomenological reality and its undoubted spatial character. In a highly 
sophisticated approach, Luneburg [617] embodied visual space in a geometrically 
formulated perceptual theory. The Luneburg theory of binocular vision not only 
provides a geometrical characterization of the internal structure of visual space but 
also describes its dependence on physical space. It accounts for classical empirical 
phenomena, such as the Helmholtz frontal horopter (Helmholtz) or the visual alleys 
(Hillebrand, Blumenfeld) that arise under viewing conditions that isolate the effect 
of binocularity by excluding monocular cues and experiential factors. These condi- 
tions are usually realized by presenting the observer with point-like light sources of 
low intensity in complete darkness. The position of the head is kept fixed, while the 
eyes are allowed to move freely. The main achievement of Luneburg was to relate 
the above-mentioned phenomena to a non-Euclidean geometrical structure of visual 
space. 


17.13.3.1 The Luneburg Theory of Binocular Vision 


The following two postulates are at the core of the Luneburg theory of binocular 
vision: 


(L1) Visual space, which may be coordinatized with respect to perceived egocentric 
distance and perceived direction, is a Riemannian space of constant Gaussian 
curvature. Its metric determines the perceived interpoint distances, while its 
geodesics are the perceptually straight lines. 

(L2) The points lying on the Vieth-Miiller circle are perceived as being equidistant 
from the observer, and the hyperbolas of Hillebrand are perceived as radial 
lines of constant direction. 


By assumption (L2), the preimages of the visual coordinate lines under the psy- 
chophysical function are the Vieth-Miiller circles and the hyperbolas of Hillebrand, 
respectively. This psychophysical relationship is not most elegantly formulated by 
the Cartesian coordinates of the physical space, where the rotation centres of the eyes 
are located symmetric to the origin on the y-axis and the stimuli are the points lying 
in the horizontal positive half-plane x > 0. Instead, introduced in [617] were bipolar 
coordinates a, £, which are angular coordinates describing the monocular directions 
relative to the rotation centres of the eyes. 

The most often used modified bipolar coordinates are then given by the linear 
combinations 


at+f 
o 


(17.96) y=a-—-fp and g= 
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The bipolar parallax y is thus the difference of the monocular directions, and the 
bipolar latitude ¢ is their average. 

The reason for referring to the modified bipolar coordinates becomes obvious if 
we notice that the trajectories y = constant are the Vieth-Miiller circles, while the 
trajectories ¢ = constant are the hyperbolas of Hillebrand. Luneburg’s postulate 
(L2) is then equivalent to the statement that the perceived egocentric distance only 
depends on y and the perceived direction only on ¢. 

There is an impressive body of experimental data that tests the empirical ade- 
quacy of the Luneburg theory. The theoretical predictions for individual observers 
were found to be qualitatively in accord with the data. The exact quantitative pre- 
dictions, however, are often not within experimental error. In looking for possible 
reasons that might be responsible for this situation, we want to focus on postulate 
(L2) without regard to the assumption concerning the geometrical structure of visual 
space as formulated in (L1). 


17.13.3.2 A Conjoint Representation Generalizing the Luneburg Theory 


Let S = {(a, 8) € V7 | a > f} be the set of stimuli with V = ] — 2/2, x/2[, where 
a, f are bipolar coordinates. 

We start by characterizing the loci of perceived equidistance and constant per- 
ceived direction, respectively, as equivalence classes that are induced by order- 
ing relations. Let us introduce binary relations <x), <g on S that describe an 
observer’s qualitative order judgments with respect to the perceptions considered. 
By (a, ) <p (a’, B’) we express that the perceived egocentric distance of the stim- 
ulus (a, 8) is judged not to exceed that of (a’, 8’). Similarly, (a, 8) <a (a’, B’) 
means that (a, /) is not perceived as lying to the right of (a’, #’). The relation <, 
thus orders the stimuli from near to far, and <g from left to right. Let ~,, ~g and 
<p, <o denote the symmetric and asymmetric parts of <p, <g, respectively. Note 
that these orderings are assumed to remain invariant under fixation changes. 

Using equation (17.96), we may now reformulate Luneburg’s psychophysical 
assumptions concerning the relations <x, and <g in the format of representational 
measurement theory. For all (a, £), (a’, B’) € S, 


(a, 8) Xp (a, B') iffa-B>a'— fp’, 


(17.96a) , oo 0 
(a, Pf) xo (@’, f’) iffa +f >a't pf. 


As an immediate consequence of (17.96a), we observe that 


(a, B) Xp (a’, B’) iff (a, i) Xo (a’, B) 


holds, whenever (a, f), (a’, f’), (a, 6’), (a’, B) € S. We call binary relations <p, 
<g dual if they have the property above. Interestingly, the duality condition is related 
to the ambiguity problem in stereopsis, which refers to the identity of the stereograms 
representing the points (a, f), (a’, B’) or (a, f’), (a’, B), respectively. 

Our generalization is obtained in a straightforward manner by assuming that an 
observer’s judgments are not based on a direct comparison of the respective bipolar 
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coordinates as in (17.96a) but by comparing some transformation or evaluation of 
these coordinates. In other words, we assume that there exist strictly increasing 
functions f, g, mapping V into real intervals (which are necessarily of positive 
length), such that for all (a, B), (a’, B') € S 


(17.96b) (a, B) Xp (a’, B') iff f(a) — g(B) = Fa’) — 8(8’), 
(17.96c) (a, B) xo (a’, B') iff f(a) + g(B) = f(a’) + 8B’). 


Heller [363] provides a set of sufficient conditions that warrants a representa- 
tion according to (17.96b) and (17.96c) with strictly increasing functions f, g. The 
empirical relational structure (S,<,, <g) satisfying these prerequisites is called a 
binocular coordinate structure. Available empirical evidence supports the proposed 
representation (see [363]). Since for the following functions f, g have to map J 
onto real intervals, we will use the term binocular coordinate structure only if this 
additional requirement is met by a relational structure (S, <,, <9). This continuity 
assumption does not pose problems in the present application. 

The approach presented amounts to a psychologically significant recoordinatiza- 
tion of physical space. Generalizing (17.96), we may define 


f(a) + 8(B) 


(17.96d) T= f(a)—g(f) and O= 5 


The coordinates [, ® of the horizontal plane at eye level are satisfying some of the 
fundamental properties that Luneburg originally intended when he introduced the 
modified bipolar coordinates y, ¢. 


17.13.4 Functional Equations Resulting from Psychophysical Invariances 


A psychophysical invariance is a physical transformation of the stimuli that leaves 
perception invariant. The psychophysical invariance can be translated into a func- 
tional equation that has to be satisfied by the psychophysical function and thereby 
restricts its possible form (cf. Falmagne [268]). In the present case, we may derive 
restrictions for the functions f and g appearing in the representations (17.96b) and 
(17.96c). 

The Luneburg theory as expressed in (17.96a) suggests the following definition 
of psychophysical invariances. 


Definition 17.51. A binary relation < on S is called 
y -shift invariant if 
(,f) = @',f) iff(@+t,8—1) x (@' +7, P —7), 
d-shift invariant if 


(a, B) < (a’, B’) iff(at+t,B +7) < (a +7, f’ +7), 
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a-shift invariant if 


(a, 2) = (@',f) iff(@t+r, £) =< @' +17, 6), 


and 
f-shift invariant if 


(a, 8) x(a’, B) iff (a,B +7) x @',f’ +7), 


whenever the pairs involved are in S. 


The significance of the transformations in Definition 17.51 (see the figure below) 
is due to the fact that they keep invariant the parallax differences between stimuli, 
which are commonly identified with the disparities inducing binocular depth percep- 
tion. 

In what follows, we show how functional equations arise from assuming that the 
transformations introduced above constitute automorphisms of the binocular coordi- 
nate structure (S,<,, <q). 


Graphical illustration of a /-shift as introduced in Definition 17.51 applied to three collinear 
points. Keeping their a-coordinates (solid lines emerging from R) fixed, the /-coordinates are 
transformed by adding an (in this case negative) amount 7, which results in a rotation around 
L that moves the fan of dashed lines into the fan of dotted lines. 


By the representation formulated in (17.96b), y -shift invariance of x, translates 
into 


f@)-g6)= f@)-s) iff f@+rt)-—g@—rt) = f@'+r1)—gG'—-2). 
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The real-valued function H given by 


(i) Al f(a) — (8), t] = flat+t)— g(b —7) 


is thus well defined and strictly increasing in its first variable; i.e., H(z, 7) strictly 
increases in z. (It is also strictly increasing in 7, but that is incidental.) The functional 
equation (i) provides a complete characterization of the y -shift invariance of <, in 
the context of the proposed conjoint representation. The table below summarizes the 
functional equations obtained similarly in each case considered. The functions H in 
these eight equations are in general different. 


TABLE 


Functional Equations Induced by the Invariance Properties Introduced in 
Definition 17.51 


Invariance Applied to Equation No. 


y -shift <p Alf@)—s(f),t1]=fla+r)—-gB—t) @ 
xo = AL f(a) + (8), tr] = flat+rt)+86-7) @) 


o-shift <p AHAlf@)—s),t1]=f@+r)—gBt+r) fi) 
xo = A f(a) + 88), t= fat) +8G+r) Gi) 


a-shift Xp A[f(a) — g8(f),t] = f(a+t)—g(f) (iii) 
Xo AL f(a) + 9(f),t]= flat+r)+9(f) Gi’) 


B-shift Xp Al f(a) — g(B),t1= fla)—g(B+r) (iv) 
Xa A[f(a) + g(f),t] = fla)+gB+r) (iv) 


Solutions of the Functional Equations (i)—(iv) and (i’/)—(iv’) 


Result 17.52. (Aczél, Boros, Heller, and Ng [39]). The general strictly increasing 
solutions f, g of (i) or (i) for all (a, B), (a +y, B —¢) € S, which map V onto real 
intervals, are given by 


fa)=artb, g)=P+d, a,c>0, 
and by 
f(a) = ae +b, g(B) = —ceP +d, a,c,C > Oora,c,C <0, 


where a, B € V. Except for the sign restrictions, a, b, c, d, and C are arbitrary 
constants. 
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Result 17.53. [39]. The general strictly increasing solutions f, g of (ii) or ii’) for 
all (a, B) € S, (a+7,£8 +7) € S, which map V onto real intervals, are given by 


f(a)=aat+b, g(S)=ch+d, a>0,c>0 
and 
f(a) = ae +b, g(B) =ceP +d, a,c,C > Oora,c,C <0, 


where a, B € V. The constants a, b, c, d, and C are arbitrary up to the positivity 
and negativity restrictions. 


Result 17.54. [39]. The general strictly increasing solutions f and g of (iii) for all 
(a, B) € S, (a +7, B) € S, which map V onto a real interval, are given by 


f(a) =aa+b,  garbitrary, a> 0, 


a € V. Under the same conditions on f and g, the general solution of (iv) for all 
(a, 2) € S, (a,B +7) €S, is given by 


f arbitrary, g(f)=cB+d, c>0, 


where BEV. 


For more details of these and other applications, see [39]. 


Symbols 


Z — the set of all integers 

Z4. — the set of all nonnegative integers 

Z*,_ — the set of all positive integers 

Z_ — the set of all negative integers 

Q — the set of all rational numbers 

Q* = Q — {0} 

R — the set of all real numbers 

Ry, — the set of all nonnegative reals (> 0) 

R* = R\{0}, R= R+ — {0} — the set of all positive real numbers 
C - the set of all complex numbers 


C* = C — {0} 
nZ={nk: keZ},neZ 
I = (0, 1] 


Io = J0, 1[, 1) = 7 or Ip 

(C) — the subgroup generated by a nonempty subset C of a group G 

f|S—a function f restricted to $ 

For subsets K, C of a field F, K+ C= {k+c:k eK andceC},K —-C= 
{k-c: ke K, cE Ch}, aK ={ak:aeF, ke K} 


1 ifx > 0, 
sgnx= 4-1 ifx <0, forx ER, 
0 ifx=0, 


-, +, 0, * are symbols of binary operations 

(f, g,h) is a homotopism means f, g, : G(o) — H (x), where G and H are 
groupoids such that f(x o y) = g(x) *h(y) 

ForC C X x X, Cy = {x € X|(x, y) € C for some y € X} 

Cy = {y € X|(x, y) € C for some x € X} 

In general, A is used for additive function, E for exponential function, L for 
logarithmic function, M for multiplicative function, B for biadditive function, D 
for derivation. 

(, ) —symbol for inner product 
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Symbols 


Min(F) — stands for all matrices with m rows and n columns over a field F, 
which could be R or C 

M,(F) — all square matrices of order n with entries from F 
det(c) — determinant of a square matrix c 

Z,y — ving of integers modulo n 

ker(f) — kernel of a mapping f 

GL, (F) — linear group of n x n matrices over a field F 

F [x] — set of all polynomials over a field or ring F 

Ty ={P = Wty P2s+.+> Pn) spe = 0, Sp Pe = 1} 

Tn = {P = (pi, p2,.--s Pn): Pe > 0, pai Pk = 1) 

A, =I, orF? 

s(x) = —xlogx — (1 — x) log(1 — x), Shannon function 
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